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COMPOSITIONS THAT INHIBIT PROLIFERATION OF CANCER 

CELLS 

This application claims priority to United States Provisional Application No. 60/239,705 
filed on October 12, 2000, entitled "Agents promoting apoptosis in cancer cells via interruption of 
5 oncogene-induced integrin signaling," which application is herein incorporated by reference in its 
entirety and to United States Provisional Application No. 60/242,812 filed on October 24, 2000 
entitled, "Agents promoting apoptosis in cancer cells via interruption of oncogene-induced integrin 
signaling", and which application is herein incorporated by reference in its entirety. 

I. BACKGROUND OF THE INVENTION 

10 In cancer cells multiple oncogenic lesions cooperate in malignant transformation. Such 

cooperation permits survival and proliferation of tumor cells in absence of contact with extra- 
cellular matrix (ECM), suggesting that tumor cell survival and proliferation have become 
independent of the engagement of integrin signaling by ECM. 

Carcinogenesis is caused by multiple cooperating genetic lesions leading to a progressive 
15 deregulation of cellular signaling and cell cycle restriction point control. The mutations involved 
result in oncogene activation or loss of tumor-suppressor gene function. Typically, single 
oncogenes are insufficient to cause malignant transformation because they simultaneously induce 
signals stimulating and inhibiting cell growth. As a result cell proliferation remains restricted. In 
contrast, cooperating oncogenic lesions act in concert to disable such inhibitory signals while 
20 reinforcing the growth-promoting stimuli. The co-operation of oncogenic lesions involves 
integration of multiple signals converging on the regulation of cell cycle-dependent kinase 
complexes (Lloyd et a!., 1997; Perez-Roger et a!. s 1999; Roper et aL, 2001; Sewing et al., 1997). 

Disclosed herein are compositions and methods that show survival of various transformed 
cell types requires cell-autonomous (autocrine) integrin signaling activity. This activity is induced 
25 by cooperating oncogenic lesions and involves induction of integrin receptor and ligand 

components such as integrin alpha6 and integrin beta4, and Iammin5-gamma2 chains. Blocking of 
integrin or the laminin ligand function induces rapid apoptosis of the transformed cells, even when 
growing in presence of ECM. In contrast, normal cells remain viable when exposed to the same 
treatment. 

30 The disclosed compositions and methods are related to the cooperation of oncogenic 

lesions controlling the ability of transformed cells to proliferate in the absence of contact with the 
extra-cellular matrix (ECM). As taught herein, oncogenes cooperate to promote a cell-autonomous 
(autocrine) integrin signaling loop that proves essential for the survival of various transformed cell 
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types. As this signaling loop is not established in corresponding normal cells, the signaling 
components of this loop constitute attractive targets for cancer therapy. 

II. SUMMARY OF THE INVENTION 

In accordance with the purposes of this invention, as embodied and broadly described 
5 herein, this invention, in one aspect, relates to compositions and methods related to integrin 
mediated cancer cell growth. 

Additional advantages of the invention will be set forth in part in the description which 
follows, and in part will be obvious from the description, or may be learned by practice of the 
invention. The advantages of the invention will be realized and attained by means of the elements 
10 and combinations particularly pointed out in the appended claims. It is to be understood that both 
the foregoing general description and the following detailed description are exemplary and 
explanatory only and are not restrictive of the invention } as claimed. 

III. BRIEF DESCRIPTION OF THE DRAWINGS 

The accompanying drawings, which are incorporated in and constitute a part of this 
15 specification, illustrate several embodiments of the invention and together with the description, 
serve to explain the principles of the invention. 

Figure 1 shows a series of schematics representative of the disclosed relationships and 
compositions. In normal cells integrin receptors signal to suppress programmed cell death 
(apoptosis) when engaged by appropriate extra-cellular matrix (ECM) ligands. When receptor- 

20 ligand interaction is lost, the cells undergo apoptosis due to the lack of survival signals (Figure 1-. 
Panel 1). In cancer cells multiple oncogenic lesions cooperate to cause malignant transformation. 
Such cooperation permits survival and proliferation of tumor cells independent of integrin 
receptor-ECM interactions. This property has been termed anchorage-independence (Panel 2). 
We have discovered the mechanistic basis of anchorage-independence. The transformed cells 

25 replace the requirement for ECM-dependent signaling with a surrogate integrin signaling loop on 
which they rely for survival. In colonic epithelial cells, activation of Ras in combination with APC 
(adenomatous polyposis coli) or p53 mutations leads to induction of integrin receptor and ligand 
components such as integrin alphaG and lammin gamma2 chains. As a consequence, laminin- 
dependent activation of alpha6/beta4 integrin receptors signals to inhibit caspase activity and thus 

30 to suppress apoptosis (Panel 3). Ablation of integrin alpha6, laminin ligand function or 

alpha6/beta4 integrin receptor function induces apoptosis of the transformed cells, even when 
growing in the presence of ECM. In contrast, non-transformed control cells remain viable when 
exposed to the same treatment, indicating that the dependence of the transformed cells on 
autocrine integrin signaling may be a particular feature of the cancer cell phenotype. The essential 

35 role of alpha6 integrin extends to various transformed cell types including mesenchymal and 
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SW480 human colon carcinoma cells. Thus inhibition of laminin/alpha6 inte grin-mediated 
signaling is an important method to induce cancer cell-specific death in a variety of cell types 
(Panel 4). 

Figure 2 shows that oncogene cooperation protects cells from apoptosis. Figure 2k shows 
5 control, APCm, Ras, APCm+Ras cells that were detached from collagen IV substrates with 

Trypsin/EDTA and kept in suspension at 2x1 0 5 cells/ml in RPMI 10%FCS for L2 h at 39°C. 
Subsequently TUNEL analysis was performed on poly-lysine treated slides. The percentage of 
tunel-positive cells was determined by immunofluorescence microscopy. Figure 2B shows 
control, APCm, Ras, APCm Ras, SW480 cells were detached and maintained in suspension as 

10 described in (A). SW480 beta4 dn/Gal4VPER cells, express a 40H-tamoxifen-inducible 
dominant-negative form of the beta4 integrin. Cells were pelleted and protein extracts were 
prepared in 300 pi of 50 mM Tris-HCl, pH 7,4; 1% NP40 ; 0.25% sodium deoxycholate; 150 mM 
NaCl ; 1 mM EGTA. 200 pi of extracts were incubated for 10 min with 2 pi of Caspase 3 
fluorometric Substrate (Upstate Biotechnology). Vmax of caspase activity was determined by 

15 measuring the fluorescence at 460 nm after excitation at 380 nm for 1 h. Figure 2C shows laminin 
gamma 1 and gamma2-specific peptides were added to APCmin+Ras cells at the concentration of 
100)ig/ml. Caspase activity was measured as in (B). Figure 2D shows caspase 8 and caspase 9 
activities were measured as in (B) using caspase 8 and caspase 9-specific fluorimetric substrates 
(Upstate Biotechnologies). 

20 Figure 3 shows alterations of integrin and laminin expression profiles in malignant cell 

transformation. Figure 3A shows integrin expression and figure 3B shows laminin expression. 
The indicated cell populations were cultured on collagen IV-coated dishes at 39°C in RPMI 10% 
FCS. Total RNA was extracted from 10 6 cells for each sample and used for RT-PCR (laminins, 
alpha4, gamma2 and GAPDH) or RNase protection. For RT-PCR 5 cDNA was subjected to 28 

25 cycles (linear range) of PCR amplification. PGR products were analyzed on a 2% agarose gel. For 
RNase protection, 10 p,g of total RNA was used per reaction. Products were resolved on a 4.5% 
polyacrylamide/lOM urea gel. 

Figure 4 shows alpha6^eta4 integrin is engaged by the laminin gamma2 chain to activate 
She (a sre homology domain containing protein). Figure 4A shows cells detached from collagen 

30 IV coated dishes with 3mM EDTA (in PBS) were incubated on ice with the indicated antibodies 
for 1 h. Cells were then plated on 96well dishes coated with gammal or gamma2-specific peptides 
and were permitted to attach for 30 min in RPMI medium at 39°C After incubation, wells were 
washed with RPMI. The percentage of attached cells per well was measured by hexoaminidase 
activity after lysis of the cells in the well and incubation with the substrate p-nitrophenol-N-acetyl- 

35 (3-D-glucosaminide for 5 h at 37°C. Figure 4B shows cells indicated were detached as in (A) and 
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resuspended at 10 7 cells/ml in PBS. After 1 h of incubation on ice, cells were pelleted by 
centrifugation 5 rnin at 900 rpm and resuspended in RPMI containing the phosphatase inhibitor 
pervanadate at lOjiM. The celts were incubated with betal and beta4 integrin antibodies as well as 
the laminin garnma2-specific peptide for 40 rnin. Protein extracts were prepared and subjected to 
5 an immuno-precipitation with an anti-She antibody. Phospho-tyrosine was detected with a 
phopho-tyrosine-specific antibody in IP-western-blots (upper panels). The levels of She protein 
were monitored with a She-specific antibody reusing the same membrane (lower panels). 

Figure 5 shows integrin alpha6/beta4 and Laminin gamma2 chain expression is essential 
for survival of transformed cells. Figure 5A1 shows the indicated cell populations were infected at 

10 an MO I, of 2 with recombinant retroviruses expressing anti-sense RNA for alpha6 integrin, the 
gamma2 laminin chain or a beta4 integrin dominant-negative mutant together with a puromycin 
resistance gene. After 2 weeks of selection with puromycin, colonies were stained with Giemsa 
and counted. APCm+Ras alpha6 and APCm+Ras p35 cells express the alpha6 integrin chain and 
the anti-apoptotic baculovirus p35, respectively. APCm+Ras cells were also plated on dishes pre- 

15 coated with a laminin gamma2- specific peptide. Figure 5A2 shows APCnr+Ras hygro, 

APCm+Ras Bcl2 and APCm+Ras p35 cells express the hygromycin resistance marker, and the 
anti-apoptotic proteins Bcl2 or p35, respectively. The cells were infected at an MOI of 2 with 
retroviruses carrying the beta4 integrin dominant-negative mutant and a puromycin resistance 

gene, or the puromycin resistance gene alone. Cells (10^) were maintained in soft agar at 39°C for 
20 two weeks after which macroscopically visible colonies were counted. Figure 5A3 shows the 
indicated cell populations were infected at an MOI of 2 with recombinant retroviruses, drug- 
selected and maintained as described in Al. Figure 5A4 shows the indicated cell populations were 
infected at an MOI of 2 with recombinant retroviruses expressing anti-sense RNA for alpha6 
integrin, the beta4 integrin dominant-negative mutant or the puromycin resistance marker. Cells 

25 (10 5 ) were maintained in soft agar and evaluated as described in A2. Figure 5B shows the effect 
of the alpha6 anti-sense RNA expression on the cell surface expression of alpha6 integrin was 
monitored by FACS analysis using an anti-alpha6 rat monoclonal antibody and a F1TC conjugated 
goat anti-rar antiserum as the secondary antibody (upper panel). Expression of the beta4 
dominant-negative mutant was confirmed by RNAse protection. A probe overlapping the c- 

30 terminal end of the beta4 dominant negative mRNA was used to measure beta4 integrin and beta4 
dn expression in the same sample (lower panel). Figure 5C shows arf null mouse embryo 
fibroblasts (MEFs) that were infected with retroviruses expressing the oncogenes Myc and Ras. 
Cells were then additionally infected with a retroviruses expressing anti-sense RNA for alpha6 
integrin or the beta4 integrin dominant-negative mutant as shown in (A). MEFS 

35 Ras/Myc/ArfnulI+aIpha6 sense express the alpha6 integrin chain. Figure 4D shows SW480 human 
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colon carcinoma cells that were infected as described in (A) using VSV pseudo-typed viruses. 
Infected cells were selected with puromycin in soft agar. Clones were counted 2 weeks after 
selection. SW480 beta4 dn/Gal4VPER cells, express a 40H-tamoxifen-inducible dominant- 
negative form of the beta4 integrin. The cells (10 5 ) were maintained in soft agar for 2 weeks in 
5 presence or absence of 40H-tamoxifen. 

IV. DETAILED DESCRIPTION 

The present invention may be understood more readily by reference to the following 
detailed description of preferred embodiments of the invention and the Examples included therein 
and to the Figures and their previous and following description. 

10 Before the present compounds, compositions, articles, devices, and/or methods are 

disclosed and described, it is to be understood that this invention is not limited to specific synthetic 
methods, specific recombinant biotechnology methods unless otherwise specified, or to particular 
reagents unless otherwise specified, as such may, of course, vary. It is also to be understood that 
the terminology used herein is for the purpose of describing particular embodiments only and is 

1 5 not intended to be limiting. 

A. Definitions 

As used in the specification and the appended claims, the singular forms "a," "an h and 
''the" include plural referents unless the context clearly dictates otherwise. Thus, for example, 
reference to "a pharmaceutical carrier" includes mixtures of two or more such carriers 3 and the 
20 like. 

Ranges may be expressed herein as from "about" one particular value, and/or to "about" 
another particular value. When such a range is expressed, another embodiment includes from the 
one particular value and/or to the other particular value. Similarly 3 when values are expressed as 
approximations, by use of the antecedent "about," it will be understood that the particular value 
25 forms another embodiment. It will be further understood that the endpoints of each of the ranges 
are significant both in relation to the other endpoint. and independently of the other endpoint. 

In this specification and in the claims which follow, reference will be made to a number of 
terms which shall be defined to have the following meanings: 

"Optional" or "optionally" means that the subsequently described event or circumstance 
30 may or may not occur, and that the description includes instances where said event or circumstance 
occurs and instances where it does not 

"Primers" are a subset of probes which are capable of supporting some type of enzymatic 
manipulation and which can hybridize with a target nucleic acid such that the enzymatic 
manipulation can occur. A primer can be made from any combination of nucleotides or nucleotide 
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derivatives or analogs available in the art which do not interfere with the enzymatic manipulation. 

"Probes 1 ' are molecules capable of interacting with a target nucleic acid, typically in a 
sequence specific manner, for example through hybridization. The hybridization of nucleic acids 
is well understood in the art and discussed herein. Typically a probe can be made from any 
5 combination of nucleotides or nucleotide derivatives or analogs available in the art. 

B. Compositions and methods 

Disclosed are compositions and methods related to integrins and integrin signaling. It is 
shown herein that integrin alpha6, integrin beta4, and lamininS, through at least up regulation of 
the beta and gamma chains of lamininS, are upregulated in cancer cells. Integrin alpha6 and 

10 integrin beta4 interact to form the integrin receptor, A6B4. The integrin receptor A6B4 

specifically interacts with lamininS and laminin5 specifically interacts with A6B4. Furthermore, it 
is shown that interference with the production or function of alpha6, beta4, or the laminin gamma2 
chain not only prevents proliferation of the cancer cells, dependent on the upregulation of alpha6, 
beta4, and the laminin gamma2 chain, but that this kills the cancer cells as well. Alpha6 and beta4 

1 5 signaling occur through the integrin receptor A6B4. Thus, interference with the formation of 
A6B4, will interfere with the function of A6B4 : for example, the signaling of A6B4. Thus, 
disclosed are compositions and methods that interfere with the function of alpha6, beta4 5 lamininS, 
the laminin gamma2 chain, or A6B4. Also disclosed are compositions and methods that interfere 
with the function of molecules involved with the signal transduction that is connected to either 

20 alpha6, beta4, lamininS, the laminin gamma2 chain, or A6B4. Also disclosed are methods for 
reducing the proliferation of cancer cells, as well as methods of killing cancer cells that involve 
using the compositions disclosed herein that interfere, reduce, or eliminate the function or the 
alpha6, beta4, lamininS, the laminin gamma2 chain, or A6B4 function. 

Disclosed herein is a relationship between two types of molecules in a cancer cell. The 
25 First type of molecule is an integrin receptor, composed of integrins, and the second type of 
molecule is a ligand that interacts with the integrin receptor, through the integrins. There are 
specificities that exist between the integrin receptors and their ligands. One aspect, disclosed 
herein is that when a cell goes from a non-cancerous state to a cancerous state, there is a co- 
upregulation of both the ligand (or parts of the ligand, such as subunits) and the cognate integrin 
30 receptor. The co-upregulation of both types of molecules creates an autocrine loop situation, 
wherein the signaling pathways controlled by the integrin receptor become autonomously 
activated, rather than exogenously activated, as would normally occur. The upregulation of both 
types of molecules creates a more fully transformed cellular phenotype in which cancer cell 
survival depends on the autocrine loop. Now therapeutic activities can target both points in the 
35 autocrine loop. Specific examples, of this co-upregulation in cancer cells are disclosed herein. For 
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example, laminin 5 (both the beta and gamma2 chains are upregulated) and integrin receptor 
alpha6beta4; laminin 10/1 1 and the integrin receptors alpha6betal and alpha3betal . 

As the Examples herein indicate which integrins, which integrin receptor, and which 
integrin ligand are involved in conferring cancer cells ability to grow in the absence of extra 
5 cellular matrix (ECM), also disclosed are methods using these integrins, integrin receptors and 
ligands to identify molecules that interact with them and/or interfere with their function. 

1. Compositions 

Disclosed are the components to be used to prepare the disclosed compositions as well as 
the compositions themselves to be used within the methods disclosed herein. These and other 

10 materials are disclosed herein, and it is understood that when combinations, subsets, interactions, 
groups, etc. of these materials are disclosed, while specific reference of each various individual 
and collective permutation of these compounds may not be explicitly disclosed, each is 
specifically contemplated and described herein. For example, if a particular beta4 or alpha6 is 
disclosed and discussed and a number of modifications that can be made to a number of molecules 

1 5 including the modifications to beta4 or alpha6 are discussed, specifically contemplated is each and 
every combination and permutation of these modifications and the modifications that are possible 
unless specifically indicated to the contrary. Thus, if a class of molecules A, B, and C are 
disclosed as well as a class of molecules D, E, and F and an example of a combination molecule, 
A-D is disclosed, then even if each is not individually recited, each is individually and collectively 

20 contemplated. Thus, combinations, A-E, A-F, B-D, B-E, B-F, C-D, C E, and C-F are considered 
disclosed. Likewise, any subset or combination of these is also disclosed. Thus, for example, the 
sub-group of A-E, B-F, and C-E would be considered disclosed. This concept applies to all 
aspects of this application including, but not limited to, steps in methods of making and using the 
disclosed compositions. Thus, if there are a variety of additional steps that can be performed it is 

25 understood that each of these additional steps can be performed with any specific embodiment or 
combination of embodiments of the disclosed methods. 

a) Integrins and their liqands 

For proper embryonic development, tissue homeostasis, and wound healing, cell 
proliferation must be tightly regulated, both in space and over time. In particular, a cell must be 
30 able to sense its relationship to other cells and the extra-cellular matrix (ECM) and convert these 
positional cues into biochemical signals affect ing the regulation of proliferation. Because of their 
ability to couple the recognition of positional cues to the activation of intracellular signaling 
pathways, adhesion receptors, such as integrins and cadherins, are likely to be necessary to achieve 
this goal. 

35 The integrins mediate cell adhesion primarily by binding to distinct, although overlapping, 
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subsets of ECM proteins. Normal cells require contact with serum-derived ECM components for 
proliferation, differentiation and survival (e.g. Clark and Brugge, 1995; Lin and Bissell, 1993; 
Parise et a!., 2000), a phenomenon called anchorage-dependence. This involves signaling through 
integrin receptors (Hynes, 1992). Fibronectin and laminin as well as other ECM proteins are 
5 known to act as ligands for integrin receptors (Akiyama et al., 1990). Integrins are transmembrane 
proteins forming alpha-beta chain heterodimers. Alpha and beta chain integrins are members of 
distinct gene families. The ligand binding specificity of the hetero dimers is determined by 
specific combinations of alpha and beta chain gene family members (Hynes, 1992). Ligand 
binding triggers signaling of integrin receptors through the cytoplasmic tail of the beta chain via 
10 interaction with various signaling components. 

Integrins activate common as well as subgroup- specific signaling pathways (Clark and 
Brugge, 1995; Giancotti and Ruoslahti, 1999). In particular, while most integrins activate focal 
adhesion kinase (FAK), the alpl, a5(31, avp3 and a6f34 integrins are coupled to the Ras- 
extracellular signal-regulated kinase (ERK) signaling pathway by She (Mainiero et al., 1997; 

15 Mainiero et al, 1995; Wary et al., 1996). She is an SH2-PTB domain adapter protein expressed in 
three forms, p46, p52 and p66, two of which (p46 and p52) link various tyrosine kinases to Ras by 
recruiting the Grb2/SOS complex to the plasma membrane (Pawson and Scott, 1997). Upon 
activation by SOS, Ras stimulates a kinase cascade culminating in the activation of the mitogen- 
activated protein kinase (MAPK) ERK (Marshall, 1995). ERK phosphorylates ternary complex 

20 transcription factors, such as Elk-1 and Sap- 1/2, and promotes transcription of the immediate-early 
gene Fos (Treisman, 1996). In primary endothelial cells and keratinocytes, mitogens and She- 
linked integrins cooperate, in a synergic fashion, to promote transcription from the Fos promoter. 
Accordingly, ligation of integrins linked to She enables these cells to progress through Gl in 
response to mitogens, whereas ligation of other integrins results in growth arrest, even in the 

25 presence of mitogens (Mainiero et aL, 1997; Wary et al. s 1996). She is like a binary switch 

controlling cell cycle progression in response to the ECM. Moreover, integrin receptors have been 
shown to induce intracellular signaling leading to AKT activation supporting cell survival (Lee 
and Juliano, 2000). 

In contrast with normal cells, cancer cells generally are able to survive and proliferate in 
30 the absence of anchorage to ECM, (Giancotti and Mainiero, 1994), suggesting that tumor cell 
survival and proliferation have become independent of the engagement of integrin signaling 
through ECM. 

Proliferation in the absence of anchorage to ECM of secondary rat embryo fibroblasts 
requires the cooperation of Ras and Myc or Ras and adenovirus Ela oncogenes (Land etal., 1983; 
35 Ruley, 1983). Similarly, murine colonic epithelial cells require both activated Ras and mutation of 
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the adenomous polyposis coli gene (APCmin) (D'Abaco et ah, 1996) in order to proliferate in 
suspension. 

Integrins are a large family of cell surface receptor molecules that function to mediate 
interactions between cells and between cells and the extracellular matrix. Integrin receptors are 
5 heterodimers composed of two subumts, an alpha integrin and a beta integrin. The heterodimer 
forms, is expressed on the cell surface, and acts to transmit signals obtained from interactions with 
the extracellular matrix or other cells, through the cellular membrane and into the cytosol of the 
cell The signal transduction that takes place occurs because of ligand interactions with the 
receptor. Integrin receptors can have a number of ligands, including collagens, fibronectins, and 
10 lammins. 

There are currently at least 18 different alpha integrins and at least 8 different beta 
integrins that have been shown to form at least 24 different alphabeta heterodimers. Certain 
integrins, such as betal, interact in a number of different heterodimers, but many subunits only 
form a single heterodimer, either because of structural constraints on their interactions, or cellular 

1 5 expression patterns that provide only a limited number of potential dimer partners. Disclosed 
herein are specific relationships that occur between a subset of integrins, integrin receptors, and 
their ligands. The disclosed relationships, revolve around the alpha6beta4 receptor, formed by the 
alpha6 and beta4 integrins. Of particular interest is the relationship between the ligand for the 
alpha6beta4 receptor, lamininS. The laminins are made up of 3 chains, an alpha chain, a beta 

20 chain, and a gamma chain. The specificity of the interaction between laminin5 and alpha6beta4 
receptor is controlled by the gamma chain. Laminin5 contains a gamma2 chain which only 
interacts with the alpha6beta4 integrin receptor. 

Integrin alpha6 has seven amino-terminal repeating segments that may fold into a seven 
unit beta-propeller, five n-tenninal FG-GAP domains and three divalent cation sites. The 

25 transmembrane domain is followed by a short cytoplasmic tail, that is alternatively spliced in A 
and B forms. The alpha6 integrin chain also shows alternative splicing between repeat units III 
and IV, resulting in the presence or absence of Exon X2. Integrin alpha6 is processed into a heavy 
and a light chain that are disulphide linked. A representative allele of the human alpha6 cDNA is 
set forth in SEQ ID NO:T It is understood that the disclosed functional domains as well as the 

30 others contained within alpha6 are considered separately disclosed as discreet fragments of the 
alpha6 protein as well as the nucleic acid that encodes them. 

Integrin beta4 contains a MIDAS-like motif and four cysteine-rich repeats, three EGF-like 
domains in the N-terminal extracellular domain, a trans-membrane region and a long cytoplasmic 
tail containing two pairs of flbronectin Type III repeats. The latter are connected by a variable 
35 segment that may undergo alternative splicing. Integrin beta4 also undergoes proteolytic 
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processing in its cytoplasmic tail, causing the 200kD mature Form to be converted to 165 and 
130kD fragments. A representative allele of the human beta4 cDNA is set forth in SEQ ID NO:5. 
It is understood that the disclosed functional domains as well as the others contained within beta4 
are considered separately disclosed as discreet fragments of the beta4 protein as well as the nucleic 
5 acid that encodes them. 

Laminin5 is composed of the laminin chains alpha3, beta3 and gamma2. LamininS can 
may contain either the shorter laminin alpha3A chain or the longer alpha3B chain. LamininS can 
also be trimmed by proteolytic processing of the N-terminal portion of its alpha3A chain and the 
N-terminal portion of the gamma2 chain. The Laminin gamma2 chain contains at least six laminin 

10 EGF-like domains (Domains LIT and Y) with an embedded laminin B domain (Domain IV) within 
the N-terminal half. The c-terrninal tail contains a coiled-coil domain. The N-terminal processed 
portion of the gamma2 chain is sufficient to bind to and activate the integrin alpha6/beta4 receptor. 
A representative allele of the human Iaminin5-gamma2 cDNA is set forth in SEQ ID NO: 13. It is 
understood that the disclosed functional domains as well as the others contained within the 

15 laminin5 protein and Iaminin5-gamma2 protein are considered separately disclosed as discreet 
fragments of the laminin5 protein and Iaminin5-gamma2 protein as well as and the nucleic acid 
that encodes them. 

Disclosed are compositions and methods for inhibiting integrin signaling, for example, 
integrin signaling dependent on alpha6 and beta4 integrins. For example, compositions and 

20 methods that inhibit integrin receptor signaling from., for example, the alpha6beta4 integrin 

receptor are disclosed. It is also understood that the integrin receptor signaling can be affected by, 
for example, interfering with a molecule, such as a ligand for the integrin receptor, or a 
downstream signaling molecule of the integrin receptor in a way that prevents the integrin receptor 
signal from being fully propagated. It is understood that the compositions and methods for 

25 inhibition of integrin signaling and function can be any composition or method that ultimately 
inhibits the cell proliferation in which the integrin is expressed, by for example killing the cell. It 
is understood that the compositions and methods typically can fall into three basic non-limiting 
classes of function regulators, which are discussed herein. 

(1) Classes 

30 (a) Production regulators 

Production regulators is a broad class of integrin function regulators that are directed at the 
production of the target integrin, by for example, preventing mRNA synthesis or expression of the 
target integrin, or by causing mRNA degradation of the target integrin which inhibits the 
translation of the target integrin. While production regulators, can be any type of molecule 

35 targeting any point in the integrin production pathway, typically these types of compositions will 
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target either the mRNA expression or the protein expression of the integrin. For example, if beta4 
integrin, alpha6 integrin, or the gamma2 subunit of lamininS, which has been shown herein to be 
upregulated in cancer cells and which causes the cancer cell to be able to live in the absence of the 
ECM, was the target integrin, a typical production regulator of beta4 integrin, alpha6 integrin, or 
5 the gamma2 subunit of larnmin5 would be for example, an antisense molecule that targeted the 
mRNA of beta4 integrin, alpha6 integrin, or the gamma2 subunit of laminin5. It is also understood 
that a production regulator could also target any molecule that is disclosed herein, or is within the 
signaling chain associated with a target integrin or integrin receptor. For example, the inhibition 
of the production of the ligand for a target integrin receptor is one way of inhibiting integrin 
1 0 function of that receptor. Thus, production regulators can either inhibit or enhance integrin 
production. 

(b) Integrin to integrin regulators 

Another type of integrin function regulator is an integrin-integrin regulator. This type of 
function regulator, typically prevents integrins from interacting to form a functional integrin 

1 5 receptor. For example, an integrin to integrin regulator could be a composition that would interact 
with beta4 in a way that would prevent beta4 from interacting with alpha6 to form the alpha6beta4 
integrin receptor or it could be a composition that would interact with alpha6 in a way that would 
prevent alpha6 from interacting with beta4 to form the alpha6beta4 integrin receptor. It is also 
contemplated that the function regulators of the integrin to integrin interaction can affect the 

20 signaling pathways dependent on integrin receptors containing alph6 or beta4 integrins. It is not 
required that an integrin to integrin regulator prevent an integrin from interacting with all of the 
possible integrin partners it could interact with, just that it prevent the interaction of the target 
integrin with another specific integrin. For example, the integrin alpha6 interacts with betal 
integrin and beta4 integrin. Tn certain embodiments, the compositions interfere with alpha6-beta4 

25 interactions but do not interfere with alpha6-betal interactions. 

(c) integrin to other molecule regulators 

The third class of function regulators are the integrin to other molecule regulators. These 
compositions are designed to specifically interfere with molecules such as small molecule ligands 
or other proteins that interact with the integrin or integrin receptor. For example, an integrin to 

30 other molecule regulator might target a ligand for a particular integrin receptor, such as the 

alpha6beta4 receptor. The ligand for the alpha6beta4 receptor is laminin5 or the laminin gamma2 
chain, Compositions that interact with lamininS such that lamininS or the laminin gamma2 chain 
interactions with alpha6beta4 are inhibited or reduced are specifically contemplated herein. 
Likewise, there are other molecules, such as She molecules, that also interact with integrins, such 

35 as the alpha6beta4 receptor. Compositions that specifically interact with the She molecules such 
that they prevent the appropriate interactions between the She molecule and the alpha6beta4 
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receptor are disclosed. 

(2) Types 

Just as there are different general classes of molecules that can regulate (such as by 
inhibition) the function of the disclosed integrins and integrin receptors, so to there are many 
5 different types of molecules that perform that regulation. For example, any molecule that can 
perform the regulation of for example, the disclosed integrins, integrin receptors, or signaling 
pathways produced by the disclosed integrins and integrin receptors are contemplated. For 
example, antibodies or small molecules which inhibit the disclosed compositions are herein 
disclosed. Also disclosed are, for example, functional nucleic acids, such as ribozymes or 
10 antisense molecules that can inhibit the disclosed integrin function in a variety of ways. A non- 
limiting list of exemplary molecules is discussed herein. 

fa) Antibodies 

Antibodies can be used to regulate, for example, the function of the disclosed integrins and 
integrin receptors, molecules that interact with the disclosed integrin receptors, and molecules in 
15 the signaling pathways of the disclosed integrin receptors. 

As used herein, the term "antibody" encompasses, but is not limited to, whole 
immunoglobulin (i.e., an intact antibody) of any class. Native antibodies are usually 
heterotetrameric glycoproteins, composed of two identical light (L) chains and two identical heavy 
(H) chains. Typically, each light chain is linked to a heavy chain by one covalent disulfide bond. 

20 while the number of disulfide linkages varies between the heavy chains of different 

immunoglobulin isotypes. Each heavy and light chain also has regularly spaced intrachain 
disulfide bridges. Each heavy chain has at one end a variable domain (V(H)) followed by a 
number of constant domains. Each light chain has a variable domain at one end (V(L)) and a 
constant domain at its other end; the constant domain of the light chain is aligned with the first 

25 constant domain of the heavy chain, and the light chain variable domain is aligned with the 
variable domain of the heavy chain. Particular amino acid residues are believed to form an 
interface between the light and heavy chain variable domains. The light chains of antibodies from 
any vertebrate species can be assigned to one of two clearly distinct types, called kappa (k) and 
lambda (1), based on the amino acid sequences of their constant domains. Depending on the amino 

30 acid sequence of the constant domain of their heavy chains, immunoglobulins can be assigned to 
different classes. There are five major classes of human immunoglobulins: IgA, IgD, IgE, IgG and 
IgM, and several of these may be further divided into subclasses (isotypes), e.g., IgG-1, IgG-2, 
IgG-3, and IgG-4; IgA- 1 and [gA-2. One skilled in the art would recognize the comparable classes 
for mouse. The heavy chain constant domains that correspond to the different classes of 

35 immunoglobulins are called alpha, delta, epsilon, gamma, and mu, respectively. 
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Antibodies can be either polyclonal or monoclonal. Polyclonal antibodies, typically are 
derived From the serum of an animal that has been immunogenically challenged, and monoclonal 
antibodies are derived as discussed herein. 

The term "variable" is used herein to describe certain portions of the variable domains that 
5 differ in sequence among antibodies and are used in the binding and specificity of each particular 
antibody for its particular antigen. However, the variability is not usually evenly distributed 
through the variable domains of antibodies. It is typically concentrated in three segments called 
complementarity determining regions (CDRs) or hypervariable regions both in the light chain and 
the heavy chain variable domains. The more highly conserved portions of the variable domains 

10 are called the framework (FR). The variable domains of native heavy and light chains each 
comprise four FR regions, largely adopting a b-sheet configuration, connected by three CDRs, 
which form loops connecting, and in some cases forming part of, the b-sheet structure. The CDRs 
in each chain are held together in close proximity by the FR regions and, with the CDRs from the 
other chain, contribute to the formation of the antigen binding site of antibodies (see KabatE. A, 

15 et al "Sequences of Proteins of Immunological Interest," National Institutes of Health, Bethesda, 
Md. (1987)). The constant domains are not involved directly in binding an antibody to an antigen, 
but exhibit various effector functions, such as participation of the antibody in antibody-dependent 
cellular toxicity. 

As used herein, the term "antibody or fragments thereof encompasses chimeric antibodies 
20 and hybrid antibodies, with dual or multiple antigen or epitope specificities, and fragments, such as 
F(ab')2, Fab', Fab and the like, including hybrid fragments. Thus, fragments of the antibodies that 
retain the ability to bind their specific antigens are provided. For example, fragments of antibodies 
which maintain EphA2 binding activity are included within the meaning of the term "antibody or 
fragment thereof." Such antibodies and fragments can be made by techniques known in the art and 
25 can be screened for specificity and activity according to the methods set forth in the Examples and 
in general methods for producing antibodies and screening antibodies for specificity and activity 
(See Harlow and Lane. Antibodies, A Laboratory Manual. Cold Spring Harbor Publications, New 
York, (1988)). 

Also included within the meaning of "antibody or fragments thereof are conjugates of 
30 antibody fragments and antigen binding proteins (single chain antibodies) as described, for 

example, in U.S. Pat. No. 4,704,692, the contents of which are hereby incorporated by reference. 
Single chain divalent antibodies are also provided. 

Optionally, the antibodies are generated in other species and "humanized" for 
administration in humans. Humanized forms oF non-human (e.g., murine) antibodies are chimeric 
35 immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F^b 1 ) 2 , or 
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other antigen-binding subsequences of antibodies) which contain minimal sequence derived from 
non-human immunoglobulin. Humanized antibodies include human immunoglobulins (recipient 
antibody) in which residues from a complementary determining region (CDR) of the recipient are 
replaced by residues from a CDR of a non-human species (donor antibody) such as mouse, rat or 
5 rabbit having the desired specificity, affinity and capacity. In some instances, Fv framework 
residues of the human immunoglobulin are replaced by corresponding non-human residues. 
Humanized antibodies may also comprise residues that are found neither in the recipient antibody 
nor in the imported CDR or framework sequences. In general, the humanized antibody will 
comprise substantially all of at least one, and typically two, variable domains, in which all or 

10 substantially all of the CDR regions correspond to those of a non-human immunoglobulin and all 
or substantially all of the FR regions are those of a human immunoglobulin consensus sequence. 
The humanized antibody optimally also will comprise at least a portion of an immunoglobulin 
constant region (Fc), typically that of a human immunoglobulin (Jones et al., Nature, 321:522-525 
(1986); Riechmann et al., Nature, 332:323-327 (1988); and Presta, Curr. Op. Struct. Biol., 2:593- 

15 596 (1992)). 

Methods for humanizing non-human antibodies are well known in the art. Generally, a 
humanized antibody has one or more amino acid residues introduced into it from a source that is 
non-human. These non-human amino acid residues are often referred to as "import" residues, 
which are typically taken from an "import" variable domain. Humanization can be essentially 

20 performed following the method of Winter and co-workers (Jones et al., Nature. 321 : 522-525 
(1986); Riechmann et al., Nature, 332:323-327 (1988); Verhoeyen et al., Science, 239:1534-1536 
(1988)), by substituting rodent CDRs or CDR sequences for the corresponding sequences of a 
human antibody. Accordingly, such "humanized" antibodies are chimeric antibodies (U.S. Pat. 
No. 4,816,567), wherein substantially less than an intact human variable domain has been 

25 substituted by the corresponding sequence from a non-human species. In practice, humanized 
antibodies are typically human antibodies in which some CDR residues and possibly some FR 
residues are substituted by residues from analogous sites in rodent antibodies. 

The choice of human variable domains, both light and heavy, to be used in making the 
humanized antibodies is very important in order to reduce antigenicity. According to the "best-fit" 

30 method, the sequence of the variable domain of a rodent antibody is screened against the entire 
library of known human variable domain sequences. The human sequence which is closest to that 
of the rodent is then accepted as the human framework (FR) for the humanized antibody (Sims et 
al., J. Immunol., 15 1:2296 (1993) and Chothia et al., J. MoL Bioh, 196:901 (1987)). Another 
method uses a particular framework derived from the consensus sequence of all human antibodies 

35 of a particular subgroup of light or heavy chains. The same framework may be used for several 
different humanized antibodies (Carter et al., Proc. Natl. Acad. Sci. USA, 89:4285 (1992); 
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PrestaetaL, J. Immunol, 151:2623 (1993)). 

It is further important that antibodies be humanized with retention of high affinity for the 
antigen and other favorable biological properties. To achieve this goal, according to a preferred 
method, humanized antibodies are prepared by a process of analysis of the parental sequences and 
5 various conceptual humanized products using three dimensional models of the parental and 
humanized sequences. Three dimensional immunoglobulin models are commonly available and 
are familiar to those skilled in the art. Computer programs are available which illustrate and 
display probable three-dimensional conformational structures of selected candidate 
immunoglobulin sequences. Inspection of these displays permits analysis of the likely role of the 

10 residues in the functioning of the candidate immunoglobulin sequence, i.e., the analysis of residues 
that influence the ability of the candidate immunoglobulin to bind its antigen. In this way, FR 
residues can be selected and combined from the consensus and import sequence so that the desired 
antibody characteristic, such as increased affinity for the target anttgen(s), is achieved. In general, 
the CDR residues are directly and most substantially involved in influencing antigen binding (see, 

15 WO 94/04679, published 3 March 1994). 

Transgenic animals (e.g., mice) that are capable, upon immunization, of producing a full 
repertoire of human antibodies in the absence of endogenous immunoglobulin production can be 
employed. For example, it has been described that the homozygous deletion of the antibody heavy 
chain joining region (J(H)) gene in chimeric and germ-line mutant mice results in complete 

20 inhibition of endogenous antibody production. Transfer of the human germ-line immunoglobulin 
gene array in such germ-line mutant mice will result in the production of human antibodies upon 
antigen challenge (see, e.g., Jakobovits et al., Proc. Natl. Acad. Sci. USA, 90:2551-255 (1993); 
Jakobovits et al., Nature, 362:255-258 (1993); Bruggemann et al., Year in Immune, 7:33 (1993)), 
Human antibodies can also be produced in phage display libraries (Hoogenboom et al., J. Mol. 

25 Biol., 227:381 (1991); Marks etal., J. Mol. Biol., 222:581 (1991)). The techniques of Cote et al. 
and Boerner et al. are also available for the preparation of human monoclonal antibodies (Cole et 
al., Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, p. 77 (1985); Boerner et al., J. 
Immunol., 147(l):86-95 (1991)). 

The present invention further provides a hybridoma cell that produces the monoclonal 
30 antibody of the invention. The term "monoclonal antibody" as used herein refers to an antibody 
obtained from a substantially homogeneous population of antibodies, i.e., the individual antibodies 
comprising the population are identical except for possible naturally occurring mutations that may 
be present in minor amounts. The monoclonal antibodies herein specifically include "chimeric" 
antibodies in which a portion of the heavy and/or light chain is identical with or homologous to 
35 corresponding sequences in antibodies derived from a particular species or belonging to a 
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particular antibody class or subclass, while the remainder of the chain(s) is identical with or 
homologous to corresponding sequences in antibodies derived from another species or belonging 
to another antibody class or subclass, as well as fragments of such antibodies, so long as they 
exhibit the desired activity (See, U.S. Pat. No. 4,816,567 and Morrison et ah, Proc. Natl. Acad. 
5 Sci. USA, 81:6851-6855 (1984)). 

Monoclonal antibodies of the invention may be prepared using hybridoma methods, such 
as those described by Kohler and Milstein, Nature, 256:495 (1975) or Harlow and Lane. 
Antibodies, A Laboratory Manual. Cold Spring Harbor Publications, New York, (1988). In a 
hybridoma method, a mouse or other appropriate host animal, is typically immunized with an 

10 immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that 
will specifically bind to the immunizing agent. Alternatively, the lymphocytes may be immunized 
in vitro. Preferably, the immunizing agent comprises EphA2. Traditionally, the generation of 
monoclonal antibodies has depended on the availability of purified protein or peptides for use as 
the immunogen. More recently DNA based immunizations have shown promise as a way to elicit 

15 strong immune responses and generate monoclonal antibodies. In this approach, DNA-based 
immunization can be used, wherein DNA encoding a portion of EphA2 expressed as a fusion 
protein with human IgGl is injected into the host animal according to methods known in the art 
(e.g., Kilpatrick KE, et al. Gene gun delivered DNA-based immunizations mediate rapid 
production of murine monoclonal antibodies to the Flt-3 receptor. Hybridoma. 1998 

20 Dec; 17(6): 5 69-76; Kilpatrick KE et al. High-affinity monoclonal antibodies to PED/PEA-15 

generated using 5 microg of DNA. Hybridoma. 2000 Aug; 19(4): 297-3 02, which are incorporated 
herein by referenced in full for the methods of antibody production). 

An alternate approach to immunizations with either purified protein or DNA is to use 
antigen expressed in baculovirus. The advantages to this system include ease of generation, high 

25 levels of expression, and post-translational modifications that are highly similar to those seen in 
mammalian systems. Use of this system involves expressing domains of EphA2 antibody as 
fusion proteins. The antigen is produced by inserting a gene fragment in-frame between the signal 
sequence and the mature protein domain of the EphA2 antibody nucleotide sequence. This results 
in the display of the foreign proteins on the surface of the virion. This method allows 

30 immunization with whole virus, eliminating the need for purification of target antigens. 

Generally, peripheral blood lymphocytes ("PBLs") are used in methods of producing 
monoclonal antibodies if cells of human origin are desired, or spleen cells or lymph node cells are 
used if non-human mammalian sources are desired. The lymphocytes are then fused with an 
immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a 
35 hybridoma cell (Goding, ''Monoclonal Antibodies: Principles and Practice" Academic Press, 
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(1986) pp. 59-103). Immortalized cell lines are usually transformed mammalian cells, including 
myeloma cells of rodent, bovine, equine, and human origin. Usually, rat or mouse myeloma cell 
lines are employed. The hybridoma cells may be cultured in a suitable culture medium that 
preferably contains one or more substances that inhibit the growth or survival of the unfused, 
5 immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine 
phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas typically 
will include hypoxanthine, aminopterin, and thymidine ( U HAT medium"), which substances 
prevent the growth of HGPRT-deficient cells. Preferred immortalized cell lines are those that fuse 
efficiently, support stable high level expression of antibody by the selected antibody-producing 

1 0 cells, and are sensitive to a medium such as HAT medium. More preferred immortalized cell lines 
are murine myeloma lines, which can be obtained, for instance, from the Salic Institute Cell 
Distribution Center, San Diego, Calif, and the American Type Culture Collection, Rockville, Md. 
Human myeloma and mouse-human heteromyeloma cell lines also have been described for the 
production of human monoclonal antibodies (Kozbor, J. Immunol., 133:3001 (1984); Brodeur et 

15 al., "Monoclonal Antibody Production Techniques and Applications" Marcel Dekker, Inc., New 
York, (1987) pp. 51-63). 

The culture medium in which the hybridoma cells are cultured can then be assayed for the 
presence of monoclonal antibodies directed against EphA2. Preferably, the binding specificity of 
monoclonal antibodies produced by the hybridoma cells is determined by immunoprecipitation or 
20 by an in vitro binding assay, such as radioimmunoassay (RIA) or enzyme-linked immunoabsorbent 
assay (ELISA). Such techniques and assays are known in the art, and are described further in the 
Examples below or in Harlow and Lane ''Antibodies, A Laboratory Manual" Cold Spring Harbor 
Publications, New York, (1988). 

After the desired hybridoma cells are identified, the clones may be subcloned by limiting 
25 dilution or FACS sorting procedures and grown by standard methods. Suitable culture media for 
this purpose include, for example, Dulbecco's Modified Eagle's Medium and RPML1640 medium. 
Alternatively, the hybridoma cells may be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones may be isolated or purified from the 
culture medium or ascites fluid by conventional immunoglobulin purification procedures such as, 
30 for example, protein A-Sepharose ? protein G, hydroxylapatite chromatography, gel electrophoresis, 
dialysis, or affinity chromatography. 

The monoclonal antibodies may also be made by recombinant DNA methods, such as 
those described in U.S. Pat. No. 4,816,567, DNA encoding the monoclonal antibodies of the 
invention can be readily isolated and sequenced using conventional procedures (e.g., by using 
35 oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and 
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light chains of murine antibodies). The hybridoma cells of the invention serve as a preferred 
source of such DNA. Once isolated, the DNA may be placed into expression vectors, which are 
then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, 
plasmacytoma cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to 
5 obtain the synthesis of monoclonal antibodies in the recombinant host cells. The DNA also may 
be modified, for example, by substituting the coding sequence For human heavy and light chain 
constant domains in place ofthe homologous murine sequences (U.S. Pat. No. 4,816,567) or by 
covalently joining to the immunoglobulin coding sequence all or part ofthe coding sequence for a 
non-immunoglobulin polypeptide. Optionally, such a non-immunoglobulin polypeptide is 
10 substituted for the constant domains of an antibody of the invention or substituted for the variable 
domains of one antigen-combining site oFan antibody of the invention to create a chimeric 
bivalent antibody comprising one antigen-combining site having speciFicity For EphA2 and another 
antigen-combining site having specificity for a diFferent antigen. 

In vitro methods are also suitable For preparing monovalent antibodies. Digestion of 
15 antibodies to produce Fragments thereoF, particularly, Fab Fragments, can be accomplished using 

routine techniques known in the art. For instance, digestion can be perFormed using papain. 

Examples oFpapain digestion are described in WO 94/29348 published Dec. 22, 1994, U.S. Pat. 

No. 4,342 } 566 3 and Harlow and Lane, Antibodies, A Laboratory Manual, Cold Spring Harbor 

Publications, New York, (1 988). Papain digestion of antibodies typically produces two identical 
20 antigen binding Fragments, called Fab Fragments, each with a single antigen binding site, and a 

residual Fc Fragment. Pepsin treatment yields a Fragment, called the F(ab 7 )2 Fragment, that has 

two antigen combining sites and is still capable oF cross-linking antigen. 

The Fab fragments produced in the antibody digestion also contain the constant domains 
of the light chain and the first constant domain of the heavy chain. Fab' Fragments diFFer From Fab 

25 Fragments by the addition of a few residues at the carboxy terminus of the heavy chain domain 
including one or more cysteines From the antibody hinge region. The F(ab')2 Fragment is a 
bivalent Fragment comprising two Fab' Fragments linked by a disulfide bridge at the hinge region. 
Fab'-SH is the designation herein for Fab* in which the cysteine residue(s) of the constant domains 
bear a free thiol group. Antibody fragments originally were produced as pairs of Fab 1 Fragments 

30 which have hinge cysteines between them. Other chemical couplings oF antibody Fragments are 
also known. 

An isolated immunogenically specific paratope or fragment oF the antibody is also 
provided. A specific immunogenic epitope ofthe antibody can be isolated From the whole 
antibody by chemical or mechanical disruption oF the molecule. The purified fragments thus 
35 obtained are tested to determine their immunogenicity and specificity by the methods taught 
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herein. Immunoreaotive paratopes of the antibody, optionally, are synthesized directly. An 
immunoreactive fragment is defined as an amino acid sequence of at least about two to five 
consecutive amino acids derived from the antibody amino acid sequence. 

One method of producing proteins comprising the antibodies of the present invention is to 
5 link two or more peptides or polypeptides together by protein chemistry techniques. For example, 
peptides or polypeptides can be chemically synthesized using currently available laboratory 
equipment using either Fmoc (9-fluorenylmethyloxycarbonyl) or Boc (tert -butyloxycarbonoyl) 
chemistry. (Applied Biosystems, Inc., Foster City, CA). One skilled in the art can readily 
appreciate that a peptide or polypeptide corresponding to the antibody of the present invention, for 

10 example, can be synthesized by standard chemical reactions. For example, a peptide or 

polypeptide can be synthesized and not cleaved from its synthesis resin whereas the other fragment 
of an antibody can be synthesized and subsequently cleaved from the resin, thereby exposing a 
terminal group which is functionally blocked on the other fragment By peptide condensation 
reactions, these two fragments can be covalently joined via a peptide bond at their carboxyl and 

1 5 amino termini, respectively, to form an antibody, or fragment thereof. (Grant GA (1992) Synthetic 
Peptides: A User Guide. W.H. Freeman and Co., N.Y. (1992); Bodansky M and Trost B., Ed. 

(1993) Principles of Peptide Synthesis. Springer-Verlag Inc., NY. Alternatively, the peptide or 
polypeptide is independently synthesized in vivo as described above. Once isolated, these 
independent peptides or polypeptides may be linked to form an antibody or fragment thereof via 

20 similar peptide condensation reactions. 

For example, enzymatic ligation of cloned or synthetic peptide segments allow relatively 
short peptide fragments to be joined to produce larger peptide fragments, polypeptides or whole 
protein domains (Abrahmsen L et al, Biochemistry, 30:4151 (1991)). Alternatively, native 
chemical ligation of synthetic peptides can be utilized to synthetically construct large peptides or 

25 polypeptides from shorter peptide fragments. This method consists of a two step chemical reaction 
(Dawson etal. Synthesis of Proteins by Native Chemical Ligation. Science, 266:776-779 (1994)). 
The first step is the chemoselective reaction of an unprotected synthetic peptide-alpha-thioester 
with another unprotected peptide segment containing an amino-terminal Cys residue to give a 
thioester-linked intermediate as the initial covalent product. Without a change in the reaction 

30 conditions, this intermediate undergoes spontaneous, rapid intramolecular reaction to form a native 
peptide bond at the ligation site. Application of this native chemical ligation method to the total 
synthesis of a protein molecule is illustrated by the preparation of human interleukin 8 (IL-8) 
(Baggiolini M etal. (1992) FEBS Lett 307:97-101; Clark-Lewis I etal., J.BioLChem., 269:16075 

(1994) ; Clark-Lewis I et al., Biochemistry, 30:3128 (1991); Rajarathnam K etal., Biochemistry 
35 33:6623-30 (1994)). 
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Alternatively, unprotected peptide segments are chemically linked where the bond formed 
between the peptide segments as a result of the chemical ligation is an unnatural (non-peptide) 
bond (Schnolzer, M et al. Science, 256:221 (1992)). This technique has been used to synthesize 
analogs of protein domains as well as large amounts of relatively pure proteins with full biological 
5 activity (deLisle Milton RC et al., Techniques in Protein Chemistry IV. Academic Press, New 
York, pp. 257-267(1992)). 

The invention also provides fragments of antibodies which have bioactivity. The 
polypeptide fragments of the present invention can be recombinant proteins obtained by cloning 
nucleic acids encoding the polypeptide in an expression system capable of producing the 

10 polypeptide fragments thereof, such as an adenovirus or bacutovirus expression system. For 
example, one can determine the active domain of an antibody from a specific hybridoma that can 
cause a biological effect associated with the interaction of the antibody with EphA2. For example, 
amino acids found to not contribute to either the activity or the binding specificity or affinity of the 
antibody can be deleted without a loss in the respective activity. For example, in various 

15 embodiments, amino or carboxy- terminal amino acids are sequentially removed from either the 
native or the modified non-immunoglobulin molecule or the immunoglobulin molecule and the 
respective activity assayed in one of many available assays. In another example, a fragment of an 
antibody comprises a modified antibody wherein at least one amino acid has been substituted for 
the naturally occurring amino acid at a specific position, and a portion of either amino terminal or 

20 carboxy terminal amino acids, or even an internal region of the antibody, has been replaced with a 
polypeptide fragment or other moiety, such as biotin, which can facilitate in the purification of the 
modified antibody. For example, a modified antibody can be fused to a maltose binding protein, 
through either peptide chemistry or cloning the respective nucleic acids encoding the two 
polypeptide fragments into an expression vector such that the expression of the coding region 

25 results in a hybrid polypeptide. The hybrid polypeptide can be affinity purified by passing it over 
an amylose affinity solid support, and the modified antibody receptor can then be separated from 
the maltose binding region by cleaving the hybrid polypeptide with the specific protease factor Xa. 
(See, for example, New England Biolabs Product Catalog, 1996, pg. 164.). Similar purification 
procedures are available for isolating hybrid proteins from eukaryotic cells as well. 

30 The fragments, whether attached to other sequences or not, include insertions, deletions, 

substitutions, or other selected modifications of particular regions or specific amino acids residues, 
provided the activity of the fragment is not significantly altered or impaired compared to the 
nonmodified antibody or antibody fragment, These modifications can provide for some additional 
property, such as to remove or add amino acids capable of disulfide bonding, to increase its bio- 

35 longevity, to alter its secretory characteristics, etc. In any case, the fragment must possess a 
bioactive property, such as binding activity, regulation of binding at the binding domain, etc. 
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Functional or active regions of the antibody may be identified by mutagenesis of a specific region 
of the protein, followed by expression and testing of the expressed polypeptide. Such methods are 
readily apparent to a skilled practitioner in the art and can include site-specific mutagenesis of the 
nucleic acid encoding the antigen. (Zoller MJ et al. Nucl. Acids Res. 10:6487-500(1982). 

5 A variety of immunoassay Formats may be used to select antibodies that selectively bind 

with a particular protein, variant, or fragment. For example, solid-phase ELISA immunoassays are 
routinely used io select antibodies selectively immunoreactive with a protein, protein variant, or 
fragment thereof. See Harlow and Lane. Antibodies, A Laboratory Manual. Cold Spring Harbor 
Publications, New York, (1988), for a description of immunoassay formats and conditions that 
10 could be used to determine selective binding. The binding affinity of a monoclonal antibody can, 
for example, be determined by the Scatchard analysis of Munson et al., Anal. Biochem., 107:220 
(1980). 

Also provided is an antibody reagent kit comprising containers of the monoclonal antibody 
or fragment thereof of the invention and one or more reagents for detecting binding of the antibody 
15 or fragment thereof to the EphA2 receptor molecule. The reagents can include, for example, 

fluorescent tags, enzymatic tags, or other tags. The reagents can also include secondary or tertiary 
antibodies or reagents for enzymatic reactions, wherein the enzymatic reactions produce a product 
that can be visualized. 

(b) Functional Nucleic Acids 
20 Functional nucleic acids can also be used to regulate the , for example, the function of the 

disclosed integrins, integrin receptors, molecules that interact with the disclosed integrin receptors, 
and molecules in the signaling pathways of the disclosed integrin receptors. 

Functional nucleic acids are nucleic acid molecules that have a specific function, such as 
binding a target molecule or catalyzing a specific reaction, Functional nucleic acid molecules can 

25 be divided into the following categories, which are not meant to be limiting. For example, 
functional nucleic acids include antisense molecules, aptamers, ribozymes, triplex forming 
molecules, and external guide sequences. The functional nucleic acid molecules can act as 
affectors, inhibitors, modulators, and stimulators of a specific activity possessed by a target 
molecule, or the functional nucleic acid molecules can possess a de novo activity independent of 

30 any other molecules. 

Functional nucleic acid molecules can interact with any macromolecule, such as DNA, 
RNA, polypeptides, or carbohydrate chains. Thus, functional nucleic acids can interact with the 
mRNA of beta4 integrin for example, or the genomic DNA of alpha6 integrin for example, or they 
can interact with the polypeptide IamininS, or the gamma2 subunit of laminin5 k Often functional 
35 nucleic acids are designed to interact with other nucleic acids based on sequence homology 
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between the target molecule and the functional nucleic acid molecule. In other situations, the 
specific recognition between the functional nucleic acid molecule and the target molecule is not 
based on sequence homology between the functional nucleic acid molecule and the target 
molecule, but rather is based on the formation of tertiary structure that allows specific recognition 
5 to take place. Antisense molecules are designed to interact with a target nucleic acid molecule 
through either canonical or non-canonical base pairing. The interaction of the antisense molecule 
and the target molecule is designed to promote the destruction of the target molecule through, for 
example, RNAseH mediated RNA-DNA hybrid degradation. Alternatively the antisense molecule 
is designed to interrupt a processing function that normally would take place on the target 

10 molecule, such as transcription or replication. Antisense molecules can be designed based on the 
sequence of the target molecule. Numerous methods for optimization of antisense efficiency by 
finding the most accessible regions of the target molecule exist. Exemplary methods would be in 
vitro selection experiments and DNA modification studies using DMS and DEPC. It is preferred 
that antisense molecules bind the target molecule with a dissociation constant (k^less than 10" 6 . It 

15 is more preferred that antisense molecules bind with a k d less than 10' 8 . It is also more preferred 
that the antisense molecules bind the target molecule with a k d less than 10~ 10 . It is also preferred 
that the antisense molecules bind the target molecule with a k d less than 10" 12 . A representative 
sample of methods and techniques which aid in the design and use of antisense molecules can be 
found in the following non-limiting list of United States patents: 5,135,917, 5,294,533, 5,627,158, 

20 5,641,754, 5,691,317, 5,780,607, 5,786,138, 5,849,903, 5,856,103, 5,919,772, 5,955,590, 
5,990,088, 5,994,320, 5,998,602, 6,005,095, 6,007,995, 6,013,522, 6,017,898, 6,018,042, 
6,025,198, 6,033,910, 6,040,296, 6,046,004, 6,046,319, and 6,057,437. 

Aptamers are molecules that interact with a target molecule, preferably in a specific way. 
Typically aptamers are small nucleic acids ranging from 15-50 bases in length that fold into 

25 defined secondary and tertiary structures, such as stem-loops or G-quartets. Aptamers can bind 
small molecules, such as ATP (United States patent 5,631,146) and theophiline (United States 
patent 5,580,737), as well as large molecules, such as reverse transcriptase (United States patent 
5,786,462) and thrombin (United States patent 5,543,293). Aptamers can bind very tightly with 
kjS from the target molecule of less than 10" !2 M. It is preferred that the aptamers bind the target 

30 molecule with a k d less than 10~ & . It is more preferred that the aptamers bind the target molecule 
with a k a less than 10" 8 . It is also more preferred that the aptamers bind the target molecule with a 
k d less than 10" 10 It is also preferred that the aptamers bind the target molecule with a k d less than 
10~ 12 . Aptamers can bind the target molecule with a very high degree of specificity. For example, 
aptamers have been isolated that have greater than a 10000 fold difference in binding affinities 

35 between the target molecule and another molecule that differ at only a single position on the 

molecule (United States patent 5,543,293). It is preferred that the aptamer have a k d with the target 
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molecule at least 10 fold lower than the k d with a background binding molecule. It is more 
preferred that the aptamer have a k d with the target molecule at least 100 fold lower than the k d 
with a background binding molecule. It is more preferred that the aptamer have a with the 
target molecule at least 1 000 fold lower than the k d with a background binding molecule. It is 
5 preferred that the aptamer have a k d with the target molecule at least 10000 fold lower than the k d 
with a background binding molecule. It is preferred when doing the comparison for a polypeptide 
for example, that the background molecule be a different polypeptide. For example, when 
determining the specificity of beta4 integrin aptamers, the background protein could be bovine 
serum albumin. Representative examples of how to make and use aptamers to bind a variety of 
10 different target molecules can be found in the following non-limiting list of United States patents: 
5,476/766, 5,503,978, 5,631,146, 5,731,424 , 5,780,228, 5,792,613, 5,795,721, 5,846/713, 
5,858,660 , 5,861,254, 5,864,026, 5,869,641, 5,958,691, 6,001,988, 6,011,020, 6,013,443, 
6,020,130, 6,028,186, 6,030,776, and 6,051,698. 

Ribozymes are nucleic acid molecules that are capable of catalyzing a chemical reaction, 

15 either intramolecularly or intermolecularly. Ribozymes are thus catalytic nucleic acid. It is 
preferred that the ribozymes catalyze intermolecular reactions. There are a number of different 
types of ribozymes that catalyze nuclease or nucleic acid polymerase type reactions which are 
based on ribozymes found in natural systems, such as hammerhead ribozymes, (for example, but 
not limited to the following United States patents: 5,334,71 1, 5,436,330, 5,616,466, 5,633,133, 

20 5,646,020, 5,652,094, 5,712,384, 5,770,715, 5,856,463, 5,861,288, 5,891,683, 5,891,684, 

5,985,621, 5,989,908, 5,998,193, 5,998,203, WO 9858058 by Ludwig and Sproat, WO 9858057 by 
Ludwig and Sproat, and WO 9718312 by Ludwig and Sproat) hairpin ribozymes (for example, but 
not limited to the following United States patents: 5,631,1 15, 5,646,031, 5,683,902, 5,712,384, 
5,856,188, 5,866,701, 5,869,339, and 6,022,962), and tetrahymena ribozymes (for example, but not 

25 limited to the following United States patents: 5,595,873 and 5,652,107). There are also a number 
of ribozymes that are not found in natural systems, but which have been engineered to catalyze 
specific reactions de novo (for example, but not limited to the following United States patents: 
5,580,967, 5,688,670, 5,807,718, and 5,910,408). Preferred ribozymes cleave RNA or DNA 
substrates, and more preferably cleave RNA substrates. Ribozymes typically cleave nucleic acid 

30 substrates through recognition and binding of the target substrate with subsequent cleavage. This 
recognition is often based mostly on canonical or non-canonical base pair interactions. This 
property makes ribozymes particularly good candidates for target specific cleavage of nucleic 
acids because recognition of the target substrate is based on the target substrates sequence. 
Representative examples of how to make and use ribozymes to catalyze a variety of different 

35 reactions can be found in the following non-limiting list of United States patents; 5,646,042, 
5,693,535, 5,731,295, 5,81 1,300, 5,837,855, 5,869,253, 5,877,021, 5,877,022, 5,972,699, 



23 



WO 02/30465 



PCT/US01/32127 



5,972,704, 5,989,906, and 6,017,756. 

Triplex forming functional nucleic acid molecules are molecules that can interact with 
either double-stranded or single-stranded nucleic acid. When triplex molecules interact with a 
target region, a structure called a triplex is formed, in which there are three strands of DNA 
5 forming a complex dependant on both Watson-Crick and Hoogsteen base-pairing. Triplex 

molecules are preferred because they can bind target regions with high affinity and specificity. It 
is preferred that the triplex forming molecules bind the target molecule with a k d less than 10* It 
is more preferred that the triplex forming molecules bind with a less than 10" 8 . It is also more 
preferred that the triplex forming molecules bind the target moelcule with a less than 10' 10 . It is 
10 also preferred that the triplex forming molecules bind the target molecule with a k d less than 10~ 12 . 
Representative examples of how to make and use triplex forming molecules to bind a variety of 
different target molecules can be found in the following non-limiting list of United States patents: 
5,176,996, 5,645,985, 5,650,316, 5,683,874, 5,693,773, 5,834,185, 5,869,246, 5,874,566, and 
5,962,426. 

15 External guide sequences (EGSs) are molecules that bind a target nucleic acid molecule 

forming a complex, and this complex is recognized by RNase P, which cleaves the target 
molecule. EGSs can be designed to specifically target a RNA molecule of choice. RNAse P aids 
in processing transfer RNA (£RNA) within a cell. Bacterial RNAse P can be recruited to cleave 
virtually any RNA sequence by using an EGS that causes the target RNA:EGS complex to mimic 

20 the natural tRNA substrate. (WO 92/03566 by Yale 3 and Forster and Altman, Science 238:407- 
409 (1990)). 

Similarly, eukaryotic EGS/RNAse P-directed cleavage of RNA can be utilized to cleave 
desired targets within eukaryotic cells. (Yuan et al., Proc. Natl. Acad. Sci. USA 89:8006-8010 
(1992); WO 93/22434 by Yale; WO 95/24489 by Yale; Yuan and Altman, EM BO J 14:159-168 
25 (\995\ and Carrara et aL Proc. Natl. Acad. Sci. fUSA) 92:2627-2631 (1995)1. Representative 
examples of how to make and use EGS molecules to facilitate cleavage of a variety of different 
target molecules be found in the following non-limiting list of United States patents: 5,168,053, 
5,624,824, 5,683,873, 5,728,521, 5,869,248, and 5,877,162 

(c) Small molecules 

30 Small molecules can also be used to regulate, for example, the function of the disclosed 

integrins, integrin receptors, molecules that interact with the disclosed integrin receptors, and 
molecules in the signaling pathways of the disclosed integrin receptors. Those of skill in the art 
understand how to generate small molecules of this type, and exemplary libraries and methods for 
isolating small molecule regulators are disclosed herein, 
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b) Compositions identified by screening with disclosed 
compositions / combinatorial chemistry 

(1) Combinatorial chemistry 

The disclosed compositions can be used as targets for any combinatorial technique to 

5 identify molecules or macromolecular molecules that interact with the disclosed compositions, 

such as beta4 integrin, alpha6 integrin, or the gamma2 subunit of laminin5, in a desired way. The 

nucleic acids, peptides, and related molecules disclosed herein can be used as targets for the 

combinatorial approaches. Also disclosed are the compositions that are identified through 

combinatorial techniques or screening techniques in which the herein disclosed compositions, for 

10 example set forth in SEQ ID NOS: 1-19 or portions thereof, are used as the target or reagent in a 

combinatorial or screening protocol. 

It is understood that when using the disclosed compositions in combinatorial techniques or 
screening methods, molecules, such as macromolecular molecules, will be identified that have 
particular desired properties such as inhibition or stimulation or the target molecule's function. 
15 The molecules identified and isolated when using the disclosed compositions, such as, beta4 

integrin, alpa6 integrin, or the gamma2 subunit of lamininS. are also disclosed. Thus, the products 
produced using the combinatorial or screening approaches that involve the disclosed compositions, 
such as, beta4 integrin, alpha6 integrin, or the gamma2 subunit of laminin5, are also considered 
herein disclosed. 

20 Combinatorial chemistry includes but is not limited to all methods for isolating small 

molecules or macromolecules that are capable of binding either a small molecule or another 
macromolecule, typically in an iterative process. Proteins, oligonucleotides, and sugars are 
examples of macromolecules. For example, oligonucleotide molecules with a given function, 
catalytic or ligand-binding, can be isolated from a complex mixture of random oligonucleotides in 

25 what has been referred to as "in vitro genetics" (Szostak, TIBS 19:89, 1992). One synthesizes a 
large pool of molecules bearing random and defined sequences and subjects that complex mixture, 
for example, approximately 10 15 individual sequences in 100 jig of a 100 nucleotide RNA, to some 
selection and enrichment process. Through repeated cycles of affinity chromatography and PGR 
amplification of the molecules bound to the ligand on the solid support, Ellington and Szostak 

30 (1990) estimated that 1 in 10'° RNA molecules folded in such a way as to bind a small molecule 
dyes. DNA molecules with such ligand-binding behavior have been isolated as well (Ellington and 
Szostak, 1992; Bocket al, 1992). Techniques aimed at similar goals exist for small organic 
molecules, proteins, antibodies and other macromolecules known to those of skill in the art 
Screening sets of molecules for a desired activity whether based on small organic libraries, 

35 oligonucleotides, or antibodies is broadly referred to as combinatorial chemistry. Combinatorial 
techniques are particularly suited for defining binding interactions between molecules and for 
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isolating molecules that have a specific binding activity, often called aptamers when the 
macromolecules are nucleic acids. 

There are a number of methods for isolating proteins which either have de novo activity or 
a modified activity. For example, phage display libraries have been used to isolate numerous 
5 peptides that interact with a specific target. (See for example, United States Patent No. 6,031,071; 
5,824,520; 5,596,079; and 5,565,332 which are herein incorporated by reference at least for their 
material related to phage display and methods relate to combinatorial chemistry) 

A preferred method for isolating proteins that have a given function is described by 
Roberts and Szostak (Roberts R.W. and Szostak J.W. Proc. Natl. Acad. Sci. USA, 

10 94(23)12997-302 (1997). This combinatorial chemistry method couples the functional power of 
proteins and the genetic power of nucleic acids. An RNA molecule is generated in which a 
puromycin molecule is covalently attached to the 3'-end of the RNA molecule. An in vitro 
translation of this modified RNA molecule causes the correct protein, encoded by the RNA to be 
translated. In addition, because of the attachment of the puromycin, a peptdyl acceptor which 

15 cannot be extended, the growing peptide chain is attached to the puromycin which is attached to 
the RNA. Thus, the protein molecule is attached to the genetic material that encodes it. Normal in 
vitro selection procedures can now be done to isolate functional peptides. Once the selection 
procedure for peptide function is complete traditional nucleic acid manipulation procedures are 
performed to amplify the nucleic acid that codes for the selected functional peptides. After 

20 amplification of the genetic material, new RNA is transcribed with puromycin at the 3 '-end, new 
peptide is translated and another functional round of selection is performed. Thus, protein 
selection can be performed in an iterative manner just like nucleic acid selection techniques. The 
peptide which is translated is controlled by the sequence of the RNA attached to the puromycin. 
This sequence can be anything from a random sequence engineered for optimum translation (i.e. 

25 no stop codons etc.) or it can be a degenerate sequence of a known RNA molecule to look for 
improved or altered function of a known peptide. The conditions for nucleic acid amplification 
and in vitro translation are well known to those of ordinary skill in the art and are preferably 
performed as in Roberts and Szostak (Roberts R.W. and Szostak J.W. Proc. Natl. Acad. Sci. 
USA, 94(23)12997-302 (1997)). 

30 Another preferred method for combinatorial methods designed to isolate peptides is 

described in Cohen et al. (Cohen B.A.,et ah, Proc. Natl. Acad. Sci. USA 95(24);14272-7 
(1998)). This method utilizes and modifies two-hybrid technology. Yeast two-hybrid systems are 
useful for the detection and analysis of proteimprotein interactions. The two-hybrid system, 
initially described in the yeast Saccharomyces cerevisiae, is a powerful molecular genetic 

35 technique for identifying new regulatory molecules, specific to the protein of interest (Fields and 
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Song, Nature 340:245-6 (1989)). Cohen et al., modified this technology so that novel interactions 
between synthetic or engineered peptide sequences could be identified which bind a molecule of 
choice. The benefit of this type of technology is that the selection is done in an intracellular 
environment. The method utilizes a library of peptide molecules that attached to an acidic 
5 activation domain. A peptide of choice, for example a portion of beta4 or alpha6 or gamrna2 or 
laminin5 is attached to a DMA binding domain of a transcriptional activation protein, such as Gal 
4. By performing the Two-hybrid technique on this type of system, molecules that bind the 
portion of beta4 or alpha6 or gamma2 or laminin5 can be identified. 

There are molecules that can act like antibodies, in that they can having varying binding 
10 specificities, that are based on a fibronectin motif. The fibronectin type III domain (FN3) is a 
small autonomous folding unit. This FN3 domain can be found in numeorus proteins that bind 
ligand, such as animal proteins. The beta-sandwich structure of FN3 closely resembles that of 
immunoglobulin domains. FN3 mutants can be isolated using combinatorial approaches disclosed 
herein, for example phage display, that bind desired targets. Typically the libraries of FN3 
15 molecules have been randomized in the two surface loops. Thus, FN3 can be used at least as a 
scaffold for engineering novel binding proteins. (Koide A, Bailey CW, Huang X, Koide S., "The 
fibronectin type III domain as a scaffold for novel binding proteins. "J Mol Biol 1998: 284,1 141- 
1151 which is herein incorporated by reference at least for material related to the fibronectin based 
novel binding proteins). 

20 Using methodology well known to those of skill in the art, in combination with various 

combinatorial libraries, one can isolate and characterize those small molecules or macromolecules, 
which bind to or interact with the desired target. The relative binding affinity of these compounds 
can be compared and optimum compounds identified using competitive binding studies, which are 
well known to those of skill in the art. 

25 Techniques for making combinatorial libraries and screening combinatorial libraries to 

isolate molecules which bind a desired target are well known to those of skill in the art. 
Representative techniques and methods can be found in but are not limited to United States patents 
5,084,824, 5,288,514, 5,449,754, 5,506,337, 5,539,083, 5,545,568, 5,556,762, 5,565,324, 
5,5 6 5,3 3 2, 5,5 73,905 , 5,6 1 8,82 5 , 5,619,680, 5,627,2 1 0 , 5,646,2 8 5 , 5,663,046, 5,670,3 2 6, 

30 5,677,195, 5,683,899, 5,688,696, 5,688,997, 5,698,685, 5,712,146, 5,721,099, 5,723,598, 
5,741,713, 5,792,431, 5,807,683, 5,807,754, 5,821,130, 5,831,014, 5,834,195, 5,834,318, 
5,834,588, 5,840,500, 5,847,150, 5,856,107, 5,856,496, 5,859,190, 5,864,010, 5,874,443, 
5,877,214, 5,880,972, 5,886,126, 5,886,127, 5,891,737, 5,916,899, 5,919,955, 5,925,527, 
5,939,268, 5,942,387, 5,945,070, 5,948,696, 5,958,702, 5,958,792, 5,962,337, 5,965,719, 

35 5,972,719, 5,976,894, 5,980,704, 5,985,356, 5,999,086, 6,001,579, 6,004,617, 6,008,321, 
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6,017,768, 6,025,371, 6,030,917, 6,040,193, 6,045,671, 6,045,755, 6,060,596, and 6,061,636. 

Combinatorial libraries can be made from a wide array of molecules using a number of 
different synthetic techniques. For example, libraries containing fused 2,4-pyrimidinediones 
(United States patent 6,025,371) dihydrobenzopyrans (United States Patent 6,017,768and 
5 5,821,130), amide alcohols (United States Patent 5,976,894), hydroxy-amino acid amides (United 
States Patent 5,972,719) carbohydrates (United States patent 5,965,719), l,4-benzodiazepin-2,5- 
diones (United States patent 5,962,337), cyclics (United States patent 5,958,792), biaryl amino 
acid amides (United States patent 5,948,696), thiophenes (United States patent 5,942,387), 
tricyclic Tetrahydroquinolines (United States patent 5,925,527), benzofurans (United States patent 

10 5,919,955), isoquinolines (United States patent 5,916,899), hydantoin and thiohydantoin (United 
States patent 5,859,190), indoles (United States patent 5,856,496), imidazol-pyrido-indole and 
imidazol-pyrido-benzothiophenes (United States patent 5,856,107) substituted 2-methylene-2, 3- 
dihydrothiazoles (United States patent 5,847,150), quinolines (United States patent 5,840,500), 
PNA (United States patent 5,831,014), containing tags (United States patent 5,721,099), 

15 polyketides (United States patent 5,712,146), morpholino-subunits (United States patent 
5,698,685 and 5,506,337), sulfamides (United States patent 5,618,825), and benzodiazepines 
(United States patent 5,288,5 14). 

Screening molecules similar to alpha6 for inhibition of alpha6beta4 formation is a method 
of identifying and isolating desired compounds that can inhibit the formation of A6B4 receptor. 

20 For example, the disclosed compositions, such as alpha6 integrin or beta4 integrin can be used as 
targets in a selection scheme disclosed herein, and then the counter part integrin could be used as a 
competitive inhibitor to isolate the desired molecules. For example, a library of molecules could 
be incubated with beta4 integrin, which is bound to a solid support. The solid support can be 
washed to remove the unbound molecules and then the solid support can be incubated with, for 

25 example, alpha6 integrin at a concentration that will saturate all beta4 binding sites. The 

molecules which are collected in the flowthrough after washing the solid support will be enriched 
for molecules that interact with beta4 integrin in a way that is competitive to the alpha6-beta4 
interaction, Likewise, the solid support, bound with a target integrin, or more preferably a target 
integrin receptor, such as alpha6beta4 receptor, could also be washed, with for example, laminin5 

30 or the gamma2 subunit of laminin5 at a concentration that will saturate all of the gamma2 binding 
sites on the beta4 integrin, Collection of the wash under these conditions will yield a population of 
molecules enriched for molecules that competitively interact with beta4 integrin at the beta4- 
gamma2 site. Another example, is the following; bind target to solid support on microtiter plate. 
Incubate with ligand in presence of gridded subset of library members (or single compounds), 

35 wash, identify competitor by reduction of ligand binding It is understood that the exemplary 
discussions of alpha6beta4 and/or beta4 are equally applicable to alpha6, as well as other 



WO 02/30465 



PCT/US01/32127 



alpha6betax receptors, such alpha6betal. 

Also disclosed are methods of isolating molecules that bind with a target molecule 
selected from the group consisting, B4 integrin, alpha6 integrin, and the gamma2 subunit of 
laminin5 comprising 1) contacting a library of molecules with the target molecule and 2) 
5 collecting molecules that bind the target molecule producing an enriched population of molecules. 

Disclosed are methods, further comprising the step of repeating steps 1 and 2 with the 
enriched population of molecules, and/or wherein the library comprises a small molecule, peptide, 
peptide mimetic, or oligonucleotide. 

As used herein combinatorial methods and libraries included traditional screening methods 
10 and libraries as well as methods and libraries used in iterative processes. 

(2) Computer assisted design 

The disclosed compositions can be used as targets for any molecular modeling technique 
to identify either the structure of the disclosed compositions or to identify potential or actual 
molecules, such as small molecules, which interact in a desired way with the disclosed 
15 compositions. The nucleic acids, peptides, and related molecules disclosed herein can be used as 
targets in any molecular modeling program or approach. 

It is understood that when using the disclosed compositions in modeling techniques, 
molecules, such as macromolecular molecules, will be identified that have particular desired 
properties such as inhibition or stimulation or the target molecule's function. The molecules 
20 identified and isolated when using the disclosed compositions, such as, beta4 ? are also disclosed. 
Thus, the products produced using the molecular modeling approaches that involve the disclosed 
compositions, such as, beta4 integrin, alpha6 integrin, or lamininS, are also considered herein 
disclosed. 

Thus, one way to isolate molecules that bind a molecule of choice is through rational 
25 design. This is achieved through structural information and computer modeling. Computer 

modeling technology allows visualization of the three-dimensional atomic structure of a selected 
molecule and the rational design of new compounds that will interact with the molecule. The 
three-dimensional construct typically depends on data from x-ray crystallographic analyses or 
NMR imaging of the selected molecule. The molecular dynamics require force field data. The 
30 computer graphics systems enable prediction of how a new compound will link to the target 
molecule and allow experimental manipulation of the structures of the compound and target 
molecule to perfect binding specificity. Prediction of what the molecule-compound interaction 
will be when small changes are made in one or both requires molecular mechanics software and 
computationally intensive computers, usually coupled with user-friendly, menu-driven interfaces 
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between the molecular design program and the user. 

Examples of molecular modeling systems are the CHARMm and QUANTA programs, 
Polygen Corporation, Waltham, MA. CHARMm performs the energy minimization and molecular 
dynamics functions. QUANTA performs the construction, graphic modeling and analysis of 
5 molecular structure. QUANTA allows interactive construction, modification, visualization, and 
analysis of the behavior of molecules with each other. 

A number of articles review computer modeling of drugs interactive with specific proteins, 
such as Rotivinen, et al., 1 988 Acta Pharmaceuiica Fennica 91, 159-166; Ripka, New Scientist 54- 
57 (June 16, 1988); McKinaly and Rossmann, 1989 AnniL Rev. Pharmacol Toxicioi 29,111- 

10 122; Perry and Davies, OSAR: Quantitative Structure-Activity Relationships in Drug Design pp. 
189-193 (Alan R. Liss, Inc. 1989); Lewis and Dean, 1989 Proc. R. Soa Lond. 236,125-140 
and 141-162; and, with respect to a model enzyme for nucleic acid components, Askew, et al., 
1989/. Am. Chenu Soc, 111,1082-1090. Other computer programs that screen and graphically 
depict chemicals are available from companies such as BioDesign, Inc., Pasadena, CA., Allelix, 

15 Inc, Mississauga, Ontario, Canada, and Hypercube, Inc., Cambridge, Ontario. Although these are 
primarily designed for application to drugs specific to particular proteins, they can be adapted to 
design of molecules specifically interacting with specific regions of DNA or RNA, once that 
region is identified. 

Although described above with reference to design and generation of compounds which 
20 could alter binding, one could also screen libraries of known compounds, including natural 
products or synthetic chemicals, and biologically active materials, including proteins, for 
compounds which alter substrate binding or enzymatic activity. 

c) Nucleic acids 

There are a variety of molecules disclosed herein that are nucleic acid based, including for 
25 example the nucleic acids that encode, for example beta4 and the gamma2 subunit of laminin5, as 
well as various functional nucleic acids. The disclosed nucleic acids are made up of for example, 
nucleotides, nucleotide analogs, or nucleotide substitutes. Non-limiting examples of these and 
other molecules are discussed herein. It is understood that for example, when a vector is expressed 
in a cell, that the expressed mRNA will typically be made up of A, C, G, and U. Likewise, it is 
30 understood that if, for example, an antisense molecule is introduced into a cell or cell environment 
through for example exogenous delivery, it is advantageous that the antisense molecule be made 
up of nucleotide analogs that reduce the degradation of the antisense molecule in the cellular 
environment. 

(1) Nucleotides and related molecules 
35 A nucleotide is a molecule that contains a base moiety, a sugar moiety and a phosphate 
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moiety. Nucleotides can be linked together through their phosphate moieties and sugar moieties 
creating an internucleoside linkage. The base moiety of a nucleotide can be adenin-9-yl (A), 
cytosin-l-yl (C), guanin-9-yl (G), uracil- 1-yl (U), and thymin-l-yl (T). The sugar moiety of a 
nucleotide is a ribose or a deoxyribose. The phosphate moiety of a nucleotide is pentavalent 
5 phosphate. An non-limiting example of a nucleotide would be 3'-AMP (3 '-adenosine 
monophosphate) or 5'-GMP (S'-guanosine monophosphate). 

A nucleotide analog is a nucleotide which contains some type of modification to either the 
base, sugar, or phosphate moieties. Modifications to the base moiety would include natural and 
synthetic modifications of A, C, G, and T/U as well as different purine or pyrimidine bases, such 

10 as uracil-5-yl (.psi.) ; hypoxanthin-9-yl (I), and 2-aminoadenin-9-yl. A modified base includes but 
is not limited to 5-methylcytosine (5-me-C). 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 
2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other 
alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5- 
halouracil and cytosine, 5-propynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5- 

15 uracil (pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8- 
substituted adenines and guanines, 5-halo particularly 5-bromo, 5-trifluoromethyl and other 5- 
substituted uracils and cytosines, 7-methylguanine and 7-methyladenine, 8-azaguanine and 8- 
azaadenine, 7-deazaguanine and 7-deazaadenine and 3-deazaguanine and 3-deazaadenine, 
Additional base modifications can be found for example in U.S. Pat. No. 3,687,808, Englisch et 

20 al., Angew r andte Chemie, International Edition, 1991, 30, 613, and Sanghvi, Y. S., Chapter 15, 
Antisense Research and Applications, pages 289-302, Crooke, S. T. and Lebleu, B. ed., CRC 
Press, 1993. Certain nucleotide analogs, such as 5-substituted pyrimidines, 6-azapyrimidines and 
N-2, N-6 and 0-6 substituted purines, including 2-aminopropyladenine, 5-propynyluracil and 5- 
propynylcytosine. 5-methylcytosine can increase the stability of duplex formation. Base 

25 modifications can be combined, for example, with a sugar modification, such as 2'-0- 
methoxyethyL to achieve unique properties such as increased duplex stability. There are 
numerous United States patents such as 4,845,205; 5,130,302; 5,134,066; 5,175,273; 5,367,066; 
5,432,272; 5 5 457,187; 5,459,255; 5,484,908; 5,502,177; 5,525,711; 5,552,540; 5,587,469; 
5,594,121, 5,596,091; 5,614,617; and 5,681,941, which detail and describe a range of base 

30 modifications. Each of these patents is herein incorporated by reference. 

Nucleotide analogs can also include modifications of the sugar moiety. Modifications to 
the sugar moiety would include natural modifications of the ribose and deoxy ribose as well as 
synthetic modifications. Sugar modifications include but are not limited to the following 
modifications at the 2' position: OH-; F-; O-, S- 3 or N-alkyl; S-, or N-alkenyl; 0-, S- or N- 
35 alkynyl; or O-alkyl-O-alkyl, wherein the alkyl, alkenyl and alkynyl may be substituted or 

unsubstituted C, to Cj 0 , alkyl or C 2 to C m alkenyl and alkynyl. 2' sugar modifications also include 
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but are not limited to -0[(CH 2 ) n 0] m CH 3 , -0(CH 2 ) n OCH 3 , -0(CH 2 ) n NH 2} -0(CH 2 ) n CH 35 - 
0(CH 2 ) n -ONH 2 , and -OCCH^pN^CH^ CH 3 )] 2 , where n and m are from 1 to about 10. 

Other modifications at the 2' position include but are not limited to: C, to C 10 lower alkyl. 
substituted lower alkyl, alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH 3 , OCR CI, Br, CN, CF 3 , 
5 OCF 3 , SOCH 33 SO, CH 3 , ONQ 25 NO,, N 3 , NH 2 , heterocycloalkyl, heterocycloalkaryl, 

aminoalkylamino, polyalkylamino, substituted silyl, an RNA cleaving group, a reporter group, an 
intercalator, a group for improving the pharmacokinetic properties of an oligonucleotide, or a 
group for improving the pharmacodynamic properties of an oligonucleotide, and other substituents 
having similar properties. Similar modifications may also be made at other positions on the sugar, 

10 particularly the 3 f position of the sugar on the 3' terminal nucleotide or in 2'-5' linked 

oligonucleotides and the 5' position of 5' terminal nucleotide. Modified sugars would also include 
those that contain modifications at the bridging ring oxygen, such as CH 2 and S. Nucleotide sugar 
analogs may also have sugar mimetics such as cyclobutyl moieties in place of the pentofuranosyl 
sugar. There are numerous United States patents that teach the preparation of such modified sugar 

15 structures such as 4,981,957; 5,118,800; 5,319,080; 5,359,044; 5,393,878; 5 ; 446,137; 5,466,786; 
5,514,785; 5,519,134; 5,567,811; 5,576,427; 5,591,722; 5,597,909; 5,610,300; 5,627,053; 
5,639,873; 5,646,265; 5,658,873; 5,670,633; and 5,700,920, each of which is herein incorporated 
by reference in its entirety. 

Nucleotide analogs can also be modified at the phosphate moiety. Modified phosphate 
20 moieties include but are not limited to those that can be modified so that the linkage between two 
nucleotides contains a phosphorothioate, chiral phosphorothioate, phosphorodithioate, 
phosphotriester, aminoalkylphosphotriester 3 methyl and other alkyl phosphonates including 3'- 
alkylene phosphonate and chiral phosphonates, phosphinates, phosphoramidates including 3'- 
amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, 
25 thionoalkylphosphonates, thionoalkylphosphotriesters, and boranophosphates. It is understood 
that these phosphate or modified phosphate linkage between two nucleotides can be through a 3'~5' 
linkage or a 2 k -5* linkage, and the linkage can contain inverted polarity such as 3'-5' to 5-3' or 2*-5 f 
to 5*-2\ Various salts, mixed salts and free acid forms are also included. Numerous United States 
patents teach how to make and use nucleotides containing modified phosphates and include but are 
30 not limited to, 3,687,808; 4,469,863; 4,476,301; 5,023,243; 5,177,196; 5,188,897; 5,264,423; 
5,276,019; 5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 
5,466,677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,1 1 1; 5,563,253; 5,571,799; 
5,587,361; and 5,625,050, each of which is herein incorporated by reference. 

It is understood that nucleotide analogs need only contain a single modification, but may 
35 also contain multiple modifications within one of the moieties or between different moieties, 
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Nucleotide substitutes are molecules having similar functional properties to nucleotides, 
but which do not contain a phosphate moiety, such as peptide nucleic acid (PNA). Nucleotide 
substitutes are molecules that will recognize nucleic acids in a Watson-Crick or Hoogsteen 
manner, but which are linked together through a moiety other than a phosphate moiety. 
5 Nucleotide substitutes are able to conform to a double helix type structure when interacting with 
the appropriate target nucleic acid. 

Nucleotide substitutes are nucleotides or nucleotide analogs that have had the phosphate 
moiety and/or sugar moieties replaced. Nucleotide substitutes do not contain a standard 
phosphorus atom. Substitutes for the phosphate can be for example, short chain alkyl or 

10 cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl intemucleoside 
linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages. These 
include those having rnorpholino linkages (formed in part from the sugar portion of a nucleoside); 
siloxane backbones; sulfide, sulfoxide and sulfone backbones ;formacetyl and thioformacetyl 
backbones; methylene formacetyl and thioformacetyl backbones; alkene containing backbones; 

15 sulfamate backbones; methyleneimino and methyl enehydrazino backbones; sulfonate and 

sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH 2 component 
parts. Numerous United States patents disclose how to make and use these types of phosphate 
replacements and include but are not limited to 5,034,506; 5,166,315; 5,185,444; 5,214,134; 
5,216,141; 5,235,033; 5,264,562; 5,264,564; 5,405,938; 5,434,257; 5,466,677; 5,470,967; 

20 5,489,677; 5,541,307; 5,561,225; 5,596,086; 5,602,240; 5,610,289; 5,602,240; 5,608,046; 

5,610,289; 5,618,704; 5,623,070; 5,663,312; 5,633,360; 5,677,437; and 5,677,439, each of which 
is herein incorporated by reference. 

It is also understood in a nucleotide substitute that both the sugar and the phosphate 
moieties of the nucleotide can be replaced, by for example an amide type linkage 
25 (aminoethylglycine) (PNA). United States patents 5,539,082; 5,714,33 1 :and 5,719,262 teach how 
to make and use PNA molecules, each of which is herein incorporated by reference. (See also 
Nielsen et ah, Science, 1991, 254, 1497-1500). 

It is also possible to link other types of molecules (conjugates) to nucleotides or nucleotide 
analogs to enhance for example, cellular uptake. Conjugates can be chemically linked to the 
30 nucleotide or nucleotide analogs. Such conjugates include but are not limited to lipid moieties 
such as a cholesterol moiety (Letsinger et al., Proc. Natl. Acad. Sci. USA, 1989, 

86, 6553-6556), cholic acid (Manoharan et al., Bioorg. Med. Chem. Let, 1994, 4, 1053- 
1060), a thioether, e.g., hexyl-S-tritylthiol (Manoharan etaL, Ann. N.Y. Acad. Sci., 1992, 660, 
306-309; Manoharan etal., Bioorg. Med. Chem. Let, 1993, 3, 2765-2770), a thiocholesterol 
35 (Oberhauser et at, Nucl. Acids Res., 1992, 20, 533-538), an aliphatic chain, e.g., dodecandiol or 
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undecyl residues (Saison-Behmoaras et aL, EMBO J., 1991, 10, 1111-1118; Kabanov et aL, FEBS 
Lett, 1990, 259, 327-330; Svinarchuk et al., Biochimie, 1993, 75, 49-54), a phospholipid, e.g., di- 
hexadecyl-rac-glycerol or tri ethyl ammonium l,2-di-0-hexadecyl-rac-glycero-3-H-phosphonate 
(Manoharan et al., Tetrahedron Lett., 1995, 36, 365 1-3654; Shea et al., Nucl. Acids Res., 1990, 
5 18, 3777-3783), a polyamine or a polyethylene glycol chain (Manoharan et al., Nucleosides & 
Nucleotides, 1995, 14, 969-973), or adamantane acetic acid (Manoharan et al., Tetrahedron Lett., 
1995, 36, 3651-3654), a palmityl moiety [Mishra et ah, Biochim. Biophys. Acta, 1995, 1264, 
229-237), or an octadecylamine or hexylamino-carbonyl-oxycholesterol moiety (Crooke et al., J. 
Pharmacol. Exp. Ther., 1996, 277, 923-937. Numerous United States patents teach the 

10 preparation of such conjugates and include, but are not limited to U.S. Pat. Nos. 4,828,979; 
4,948,882; 5,218,105; 5,525,465; 5,541,313; 5,545,730; 5,552,538; 5,578,717, 5,580,731; 
5,580,731; 5,591,584, 5,109,124; 5,118,802; 5,138,045; 5,414,077; 5,486,603; 5,512,439; 
5,578,718; 5,608,046; 4,587,044; 4,605,735; 4,667,025; 4,762,779; 4,789,737; 4,824,941; 
4,835,263; 4,876,335; 4,904,582; 4,958,013; 5,082,830; 5,1 12,963; 5,214,136; 5,082,830; 

15 5,112,963; 5,214,136; 5,245,022; 5,254,469; 5,258,506; 5,262,536; 5,272,250; 5,292,873; 
5,317,098; 5,371,241, 5,391,723; 5,416,203, 5,451,463; 5,510,475; 5,512,667; 5,514,785; 
5,565,552; 5,567,810; 5,574,142; 5,585,481; 5,587,371; 5,595,726; 5,597,696; 5,599,923; 
5,599,928 and 5,688,941, each of which is herein incorporated by reference. 

It is understood that an oligonucleotide can be made from any combination of nucleotides, 
20 nucleotide analogs, or nucleotide substitutes disclosed herein or related molecules not specifically 
recited herein. 

A Watson-Crick interaction is at least one interaction with the Watson-Crick face of a 
nucleotide, nucleotide analog, or nucleotide substitute. The Watson-Crick face of a nucleotide, 
nucleotide analog, or nucleotide substitute includes the C2, Nl, and C6 positions of a purine based 
25 nucleotide, nucleotide analog, or nucleotide substitute and the C2, N3, C4 positions of a 
pyrimidine based nucleotide, nucleotide analog, or nucleotide substitute. 

A Hoogsteen interaction is the interaction that takes place on the Hoogsteen face of a 
nucleotide or nucleotide analog, which is exposed in the major groove of duplex DNA. The 
Hoogsteen face includes the N7 position and reactive groups (NH2 or O) at the C6 position of 
30 purine nucleotides. 

(2) Sequences 

There are a variety of sequences related to the beta4 integrin or the Iammin5~gamma2 gene 
or the alpha6 integrin gene having the following Genbank Accession Numbers 6453379, 4557674, 
and AH006634, respectively. These sequences and others are herein incorporated by reference in 
35 their entireties as well as for individual subsequences contained therein. 
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One particular sequence set forth in SEQ ID NO:l (beta4 integrin) and having Genbank 
accession number 6453379 is used herein, at various points, as an example, to exemplify the 
disclosed compositions and methods (or when another particular sequence is used as an example). 
It is understood that the description related to this sequence is applicable to any sequence related to 
5 beta4 integrin or any of the other molecules disclosed herein, such as alpha6 integrin, or the 
subunits of laminin5, such as gamma2, unless specifically indicated otherwise. Those of skill in 
the art understand how to resolve sequence discrepancies and differences and to adjust the 
compositions and methods relating to a particular sequence to other related sequences (i.e, 
sequences of beta4 integrin). Primers and/or probes can be designed for any beta4 integrin 
10 sequence given the information disclosed herein and known in the art. 

(3) Primers and probes 
Disclosed are compositions including primers and probes, which are capable of interacting 
with the for example, the alpha6 gene or mRNA, beta4 gene or mRNA, or gamma2 subunit of the 
laminin5 ligand gene as disclosed herein or mRNA as wells as primers or probes for any of the 

15 sequences or fragments of the sequences, set forth in SEQ ID NOs: 1,3,5,7,9,1 1, and 13. In certain 
embodiments the primers are used to support DNA amplification reactions. Typically the primers 
will be capable of being extended in a sequence specific manner. Extension of a primer in a 
sequence specific manner includes any methods wherein the sequence and/or composition of the 
nucleic acid molecule to which the primer is hybridized or otherwise associated directs or 

20 influences the composition or sequence of the product produced by the extension of the primer. 
Extension of the primer in a sequence specific manner therefore includes, but is not limited to, 
PCR, DNA sequencing, DNA extension, DNA polymerization, RNA transcription, or reverse 
transcription. Techniques and conditions that amplify the primer in a sequence specific manner 
are preferred. In certain embodiments the primers are used for the DNA amplification reactions, 

25 such as PCR or direct sequencing. It is understood that in certain embodiments the primers can 
also be extended using non-enzymatic techniques, where for example, the nucleotides or 
oligonucleotides used to extend the primer are modified such that they will chemically react to 
extend the primer in a sequence specific manner. Typically the disclosed primers hybridize with 
the beta4 gene, alpha6 gene, or Iammin5-gamma2 subunit gene, for example, or region of the 

30 beta4 gene, alphaG gene, or Iaminin5-gamma2 subunit gene, for example, or they hybridize with 
the complement of the beta4 gene, alpha6 gene, or Iaminin5-gamma2 subunit gene, for example, or 
complement of a region of the beta4 gene, alpha6 gene, or Iaminm5-gamrna2 subunit gene, or any 
of the sequences or fragments of the sequences, set forth in SEQ ID NOs: 1 ,3,5,7,9,1 1, and 13, for 
example. 

35 The size of the primers or probes for interaction with the beta4 gene, alpha6 gene, or 

Iaminin5-gamma2 subunit gene, for example, in certain embodiments can be any size that supports 
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the desired enzymatic manipulation of the primer, such as DNA amplification or the simple 
hybridization of the probe or primer. A typical primer or probe for beta4 gene, alpha6 gene, or 
Iaminin5-gamma2 subunit gene, for example, or primer or probe for any of the sequences or 
fragments of the sequences, set forth in SEQ ID NOs: 1,3,5,7,9,1 1, and 13 would be at least about 
5 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 16, 17, 18, 19,20, 21,22, 23, 24, 25,26,27, 28,29, 30,31, 32, 33, 
34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51 , 52, 53 : 54, 55, 56, 57, 58, 59, 
60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 
86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 250, 275, 300, 
325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 1250, 
10 1 500, 1 750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long. 

In other embodiments a primer or probe for beta4 gene, alpha6 gene, or Iaminin5-garnma2 
subunit gene, for example, primer or probe or a primer or probe for any of the sequences or 
fragments of the sequences, set forth in SEQ ID NOs: 1 ,3,5,7,9,11, and 13 can be less than or equal 
to about 6, 7, 8, 9, 10, 11, 12 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26,27, 28, 29, 30, 
15 31,32, 33, 34, 35, 36, 37, 38, 39, 40, 41 , 42, 43, 44, 45, 46, 47, 48, 49, 50, 5 1 , 52, 53, 54, 55, 56, 
57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 
83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 250, 
275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 
1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long. 

20 The primers for the beta4 gene, alpha6 gene, or Iaminin5-gamma2 subunit gene, or any of 

the sequences or fragments of the sequences, set forth in SEQ ID NOs:l, 3, 5,7,9,1 1, and 13, for 
example, typically will be used to produce an amplified DNA product that contains a desired 
region. In general, typically the size of the product will be such that the size can be accurately 
determined to within 3, or 2 or 1 nucleotides. 

25 In certain embodiments this product is at least about 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 

30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 
56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 
82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 
250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 

30 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long. 

In other embodiments the product is less than or equal to about 20, 21, 22, 23, 24, 25, 26, 
27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 
53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 
79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 
35 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 550, 600, 650, 700, 750, 800, 850, 
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900, 950, 1000, 1250, 1500, 1750, 2000, 2250, 2500, 2750, 3000, 3500, or 4000 nucleotides long. 

d) Sequence similarities 

It is understood that as discussed herein the use of the terms homology and identity mean 
the same thing as similarity. Thus, for example, if the word homology is used between two non- 
5 natural sequences it is understood that this is not necessarily indicating an evolutionary 

relationship between these two sequences, but rather is looking at the similarity or relatedness 
between their nucleic acid sequences. Many of the methods for determining homology between 
two evolutionarily related molecules are routinely applied to any two or more nucleic acids or 
proteins for the purpose of measuring sequence similarity regardless of whether or not they are 
10 evolutionarily related. 

In general, it is understood that one way to define any known variants and derivatives or 
those that might arise, of the disclosed genes and proteins herein, is through defining the variants 
and derivatives in terms of homology to specific known sequences. This identity of particular 
sequences disclosed herein is also discussed elsewhere herein. In general, variants of genes and 

15 proteins herein disclosed typically have at least, about 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 
82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 percent homology to the 
stated sequence or the native sequence. Those of skill in the art readily understand how to 
determine the homology of two proteins or nucleic acids, such as genes. For example, the 
homology can be calculated after aligning the tw r o sequences so that the homology is at its highest 

20 level. 

Another way of calculating homology can be performed by published algorithms. Optimal 
alignment of sequences for comparison may be conducted by the local homology algorithm of 
Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the homology alignment algorithm of 
Needleman and Wunsch, J. MoL Biol. 48: 443 (1970), by the search for similarity method of 
25 Pearson and Lipman, Proc. Natl. Acad. Sci, U.S.A. 85: 2444 (1988), by computerized 

implementations ofthese algorithms (GAP, BESTFIT, FASTA, and TFASTAfn the Wisconsin 
Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by 
inspection. 

The same types of homology can be obtained for nucleic acids by for example the 
30 algorithms disclosed in Zuker, M. Science 244:48-52, 1 989, Jaeger et al. Proc. NatL Acad, Sci. 
USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymoi 183:281-306, 1989 which are herein 
incorporated by reference for at least material related to nucleic acid alignment. It is understood 
that any of the methods typically can be used and that in certain instances the results of these 
various methods may differ, but the skilled artisan understands if identity is found with at least one 
35 of these methods, the sequences would be said to have the stated identity, and be disclosed herein, 
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For example, as used herein, a sequence recited as having a particular percent homology to 
another sequence refers to sequences that have the recited homology as calculated by any one or 
more of the calculation methods described above. For example, a first sequence has 80 percent 
homology, as defined herein, to a second sequence if the first sequence is calculated to have 80 
5 percent homology to the second sequence using the Zuker calculation method even if the first 
sequence does not have 80 percent homology to the second sequence as calculated by any of the 
other calculation methods, As another example, a first sequence has 80 percent homology, as 
defined herein, to a second sequence if the first sequence is calculated to have 80 percent 
homology to the second sequence using both the Zuker calculation method and the Pearson and 

10 Lipman calculation method even if the first sequence does not have 80 percent homology to the 
second sequence as calculated by the Smith and Waterman calculation method, the Needleman and 
Wunsch calculation method, the Jaeger calculation methods, or any of the other calculation 
methods. As yet another example, a first sequence has 80 percent homology, as defined herein, to 
a second sequence if the first sequence is calculated to have 80 percent homology to the second 

15 sequence using each of calculation methods (although, in practice, the different calculation 
methods will often result in different calculated homology percentages). 

It is also understood that functional fragments as well as antigenic fragments as well as 
fragments that can be used in selection protocols of the disclosed compositions are also disclosed. 
For example, integrins have domains that interact with the other integrins. It may be advantageous 
20 in certain embodiments to utilize just the integrin binding domain fragment of, for example, the 
beta4 integrin, in a selection protocol disclosed herein. By using this domain of the beta4 integrin 
as the selection target, for example, the selection protocol will be biased for molecules binding this 
domain of beta4 integrin. 

e) Hybridization/selective hybridization 

25 The term hybridization typically means a sequence driven interaction between at least two 

nucleic acid molecules, such as a primer or a probe and a gene. Sequence driven interaction means 
an interaction that occurs between two nucleotides or nucleotide analogs or nucleotide derivatives 
in a nucleotide specific manner. For example, G interacting with C or A interacting with T are 
sequence driven interactions. Typically sequence driven interactions occur on the Watson-Crick 

30 face or Hoogsteen face of the nucleotide. The hybridization of two nucleic acids is affected by a 
number of conditions and parameters known to those of skill in the art. For example, the salt 
concentrations, pH, and temperature of the reaction all affect whether two nucleic acid molecules 
will hybridize. 

Parameters for selective hybridization between two nucleic acid molecules are well known 
35 to those of skill in the art. For example, in some embodiments selective hybridization conditions 
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can be defined as stringent hybridization conditions. For example, stringency of hybridization is 
controlled by both temperature and salt concentration of either or both of the hybridization and 
washing steps. For example, the conditions of hybridization to achieve selective hybridization 
may involve hybridization in high ionic strength solution (6X SSC or 6X SSPE) at a temperature 
5 that is about 12-25°C below the Tm (the melting temperature at which half of the molecules 

dissociate from their hybridization partners) followed by washing at a combination of temperature 
and salt concentration chosen so that the washing temperature is about 5°C to 20°C below the Tm. 
The temperature and salt conditions are readily determined empirically in preliminary 
experiments in which samples of reference DNA immobilized on filters are hybridized to a labeled 

10 nucleic acid of interest and then washed under conditions of different stringencies. Hybridization 
temperatures are typically higher for DNA-RNA and RNA-RNA hybridizations. The conditions 
can be used as described above to achieve stringency, or as is known in the art. (Sambrook et al., 
Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring 
Harbor, New York, 1989; Kunkel et al. Methods Enzymol. 1987: 154:367, 1987 which is herein 

15 incorporated by reference for material at least related to hybridization of nucleic acids). A 

preferable stringent hybridization condition for a DNA:DNA hybridization can be at about 68°C 
(in aqueous solution) in 6X SSC or 6X SSPE followed by washing at 68°C. Stringency of 
hybridization and washing, if desired, can be reduced accordingly as the degree of 
complementarity desired is decreased, and further, depending upon the G-C or A-T richness of any 

20 area wherein variability is searched for. Likewise, stringency of hybridization and washing, if 
desired, can be increased accordingly as homology desired is increased, and further, depending 
upon the G-C or A-T richness of any area wherein high homology is desired, all as known in the 
art. 

Another way to define selective hybridization is by looking at the amount (percentage) of 
25 one of the nucleic acids bound to the other nucleic acid. For example, in some embodiments 

selective hybridization conditions would be when at least about, 60, 65, 70, 71, 72, 73, 74, 75, 76, 
77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent 
of the limiting nucleic acid is bound to the non-limiting nucleic acid. Typically, the non-limiting 
primer is in for example, 10 or 100 or 1000 fold excess. This type of assay can be performed at 
30 under conditions where both the limiting and non-limiting primer are for example, 10 fold or 100 
fold or 1000 fold below their kj, or where only one of the nucleic acid molecules is 10 fold or 100 
fold or 1000 fold or where one or both nucleic acid molecules are above their k d . 

Another way to define selective hybridization is by looking at the percentage of primer 
that gets enzymatically manipulated under conditions where hybridization is required to promote 
35 the desired enzymatic manipulation. For example, in some embodiments selective hybridization 
conditions would be when at least about, 60, 65, 70, 7 1, 72, 73, 74, 75, 76 ; 77, 78, 79, 80, 8 1 . 82, 
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83, 84, 85, 86, 87, 88, 89, 90, 91 , 92, 93, 94 ? 95, 96, 97, 98, 99, 100 percent of the primer is 
enzymatically manipulated under conditions which promote the enzymatic manipulation, for 
example if the enzymatic manipulation is DNA extension, then selective hybridization conditions 
would be when at least about 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 
5 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of the primer molecules are 
extended. Preferred conditions also include those suggested by the manufacturer or indicated in 
the art as being appropriate for the enzyme performing the manipulation. 

Just as w r ith homology, it is understood that there are a variety of methods herein disclosed 
for determining the level of hybridization between two nucleic acid molecules. It is understood 
10 that these methods and conditions may provide different percentages of hybridization between two 
nucleic acid molecules, but unless otherwise indicated meeting the parameters of any of the 
methods would be sufficient For example if 80% hybridization was required and as long as 
hybridization occurs within the required parameters in any one of these methods it is considered 
disclosed herein. 

15 It is understood that those of skill in the art understand that if a composition or method 

meets any one of these criteria for determining hybridization either collectively or singly it is a 
composition or method that is disclosed herein. 

f) Delivery of the compositions to cells 

The disclosed compositions and methods often entail delivery of the compositions to cells. 

20 For example, antisense molecules directed to alpha6 mKNA or gamma2 mRNA can be delivered 
to cells via any method. A number of exemplary methods are disclosed herein. It is also 
understood that in certain embodiments, non-nucleic acid molecules will be and can be delivered 
to cells, for example an antibody to beta4 integrin, alpha6 integrin or, gamma2, or a small 
molecule, or a peptide. Delivery of these molecules can occur by any means, and exemplary 

25 compositions and methods for such delivery are disclosed herein. 

There are a number of compositions and methods which can be used to deliver nucleic 
acids to cells, either in vitro or in vivo. These methods and compositions can largely be broken 
down into two classes: viral based delivery systems and non-viral based delivery systems. For 
example, the nucleic acids can be delivered through a number of direct delivery systems such as, 

30 electroporation, lipofection, calcium phosphate precipitation, plasmids, viral vectors, viral nucleic 
acids, phage nucleic acids, phages, cosmids, or via transfer of genetic material in cells or carriers 
such as cationic liposomes. Appropriate means for trans fection, including viral vectors, chemical 
trans fectants, or physico-mechanical methods such as electroporation and direct diffusion of DNA, 
are described by, for example, Wolff, J. A., et aL, Science, 247, 1465-1468, (1990); and Wolff, J, 

35 A. Nature, 352, 815-818, (1991). Such methods are well known in the art and readily adaptable for 
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use with the compositions and methods described herein. In certain cases, the methods will be 
modified to specifically function with large DNA molecules. Further, these methods can be used 
to target certain diseases and cell populations by using the targeting characteristics of the carrier. 

(1) Nucleic acid based delivery systems 
5 Transfer vectors can be any nucleotide construction used to deliver genes into cells [e.g., a 

plasmid), or as part of a general strategy to deliver genes, e.g., as part of recombinant retrovirus or 
adenovirus (Ram et al. Cancer Res. 53:83-88, (1993)). 

As used herein, plasmid or viral vectors are agents that transport the disclosed nucleic 
acids, such as gamma2 anti sense producing molecules into the cell without degradation and 

10 include a promoter yielding expression of the gene in the cells into which it is delivered. In some 
embodiments the delivery vectors are derived from either a virus or a retrovirus. Viral vectors are, 
for example, Adenovirus, Adeno-associated virus, Herpes virus, Vaccinia virus, Polio virus, AIDS 
virus, neuronal trophic virus, Sindbis and other RNA viruses, including these viruses with the HIV 
backbone. Also preferred are any viral families which share the properties of these viruses which 

15 make them suitable for use as vectors. Retroviruses include Murine Moloney Leukemia virus, 
MMLV, and retroviruses that express the desirable properties of MMLV as a vector. Retroviral 
vectors are able to carry a larger genetic pay load, i.e., a transgene or marker gene, than other viral 
vectors, and for this reason are a commonly used vector. However, other than Lentivirus vectors , 
they are typically not as useful in non-proliferating cells. Adenovirus vectors are relatively stable 

20 and easy to work with, have high titers, and can be delivered in aerosol formulation, and can 

transfect non-dividing cells. Pox viral vectors are large and have several sites for inserting genes, 
they are thermostable and can be stored at room temperature. A preferred embodiment is a viral 
vector which has been engineered so as to suppress the immune response of the host organism, 
elicited by the viral antigens. Preferred vectors of this type will carry coding regions for 

25 Interleukin 8 or 10. 

Viral vectors can have higher transaction (ability to introduce genes) abilities than 
chemical or physical methods to introduce genes into cells. Typically, viral vectors contain, 
nonstructural early genes, structural late genes, an RNA polymerase IK transcript, inverted 
terminal repeats necessary for replication and encapsidation, and promoters to control the 

30 transcription and replication of the viral genome. When engineered as vectors, viruses typically 
have one or more of the early genes removed and a gene or gene/promoter cassette is inserted into 
the viral genome in place of the removed viral DNA. Constructs of this type can carry up to about 
8 kb of foreign genetic material. The necessary functions of the removed early genes are typically 
supplied by cell lines which have been engineered to express the gene products of the early genes 

35 in trans. 
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(a) Retroviral Vectors 
A retrovirus is an animal virus belonging to the virus family of Retroviridae, including any 
types, subfamilies, genus, or tropisms. Retroviral vectors, in general, are described by Verma, 
LM., Retroviral vectors for gene transfer. In Microbiology- 1985, American Society for 
5 Microbiology, pp. 229-232, Washington, (1985), which is incorporated by reference herein. 
Examples of methods for using retroviral vectors for gene therapy are described in U.S. Patent 
Nos. 4,868,116 and 4,980,286; PCT applications WO 90/02806 and WO 89/07136; and Mulligan, 
(Science 260:926-932 (1993)); the teachings of which are incorporated herein by reference. 

A retrovirus is essentially a package which has packed into it nucleic acid cargo. The 

10 nucleic acid cargo carries with it a packaging signal, which ensures that the replicated daughter 
molecules will be efficiently packaged within the package coat. In addition to the package signal, 
there are a number of molecules which are needed in cis, for the replication, and packaging of the 
replicated virus. Typically a retroviral genome, contains the gag, pol, and env genes which are 
involved in the making of the protein coat. It is the gag, pol, and env genes which are typically 

15 replaced by the foreign DNA that it is to be transferred to the target cell. Retrovirus vectors 
typically contain a packaging signal for incorporation into the package coat, a sequence which 
signals the start of the gag transcription unit, elements necessary for reverse transcription, 
including a primer binding site to bind the tRNA primer of reverse transcription, terminal repeat 
sequences that guide the switch of RNA strands during DJNA synthesis, a purine rich sequence 5' to 

20 the 3' LTR that serve as the priming site for the synthesis of the second strand of DNA synthesis, 
and specific sequences near the ends of the LTRs that enable the insertion of the DNA state of the 
retrovirus to insert into the host genome. The removal of the gag, pol, and env genes allows for 
about 8 kb of foreign sequence to be inserted into the viral genome, become reverse transcribed, 
and upon replication be packaged into a new retroviral particle. This amount of nucleic acid is 

25 sufficient for the delivery of a one to many genes depending on the size of each transcript. It is 
preferable to include either positive or negative selectable markers along with other genes in the 
insert. 

Since the replication machinery and packaging proteins in most retroviral vectors have 
been removed (gag, pol, and env), the vectors are typically generated by placing them into a 

30 packaging cell line. A packaging cell line is a cell line which has been transfected or transformed 
with a retrovirus that contains the replication and packaging machinery, but lacks any packaging 
signal. When the vector carrying the DNA of choice is transfected into these cell lines, the vector 
containing the gene of interest is replicated and packaged into new retroviral particles, by the 
machinery' provided in cis by the helper cell. The genomes for the machinery are not packaged 

35 because they lack the necessary signals. 
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(b) Adenoviral Vectors 

The construction of replication-defective adenoviruses has been described (Berkner et al., 
J. Virology 61:1213-1220 (1987); Massie et al, Mol. Cell. Biol. 6:2872-2883 (1986); Haj- 
Ahmadetal., J. Virology 57:267-274 (1986); Davidson et al., J. Virology 61 : 1226- 1239 (1987); 
5 Zhang "Generation and identification of recombinant adenovirus by liposome-mediated 

transfection and PCR analysis" BioTechniques 15:868-872 (1993)). The benefit of the use of 
these viruses as vectors is that they are limited in the extent to which they can spread to other cell 
types, since they can replicate within an initial infected cell, but are unable to form new infectious 
viral particles. Recombinant adenoviruses have been shown to achieve high efficiency gene 

10 transfer after direct, in vivo delivery to airway epithelium, hepatocytes, vascular endothelium, 
CNS parenchyma and a number of other tissue sites (Morsy, J. Clin. Invest. 92:1580-1586 
(1993); Kirshcnbaum, J. Clin. Invest. 92:381-387 (1993); Roessler, J. Clin. Invest. 92:1085- 
1092 (1993); Moullier, Nature Genetics 4:154-159 (1993); La Salle, Science 259:988-990 (1993); 
Gomez-Foix, J. Biol. Chem. 267:25 129-25134 (1992); Rich, Human Gene Therapy 4:461-476 

15 (1993); Zabner, Nature Genetics 6:75-83 (1994); Guzman, Circulation Research 73:1201-1207 
(1993); Bout, Human Gene Therapy 5:3-10 (1994); Zabner, Cell 75:207-216 (1993); Caillaud, 
Eur. J. Neuroscience 5:1287-1291 (1993); and Ragot, J. Gen. Virology 74:501-507 (1993)). 
Recombinant adenoviruses achieve gene transduction by binding to specific cell surface receptors, 
after which the virus is internalized by receptor-mediated endocytosis, in the same manner as wild 

20 type or replication- defective adenovirus (Chardonnet and Dales, Virology 40:462-477 (1970); 
Brown and Burlingham, J. Virology 12:386-396 (1973); Svensson and Persson, J". Virology 
55:442-449 (1985); Seth, etal., J. Virol. 5 1 :650-655 (1984); Seth, et al., Mol. Cell. Biol. 
4:1528-1533 (1984); Varga et al., J. Virology 65:6061-6070 (1991); Wickham et al., Cell 
73:309-319 (1993)). 

25 A viral vector can be one based on an adenovirus which has had the El gene removed and 

these virons are generated in a cell line such as the human 293 cell line. In another preferred 
embodiment both the El and E3 genes are removed from the adenovirus genome. 

(c) Adeno-associated viral vectors 

Another type of viral vector is based on an adeno-associated virus (AAV). This defective 
30 parvovirus is a preferred vector because it can infect many cell types and is nonpathogenic to 
humans. AAV type vectors can transport about 4 to 5 kb and wild type AAV is known to stably 
insert into chromosome 19. Vectors which contain this site specific integration property are 
preferred. An especially preferred embodiment of this type of vector is the P4.1 C vector produced 
by Avigen, San Francisco, CA, which can contain the herpes simplex virus thymidine kinase gene, 
35 HSV-tk, and/or a marker gene, such as the gene encoding the green fluorescent protein, GFP. 
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In another type of AAV virus, the AAV contains a pair of inverted terminal repeats (ITRs) 
which flank at least one cassette containing a promoter which directs cell-specific expression 
operably linked to a heterologous gene. Heterologous in this context refers to any nucleotide 
sequence or gene which is not native to the AAV or B19 parvovirus. 

5 Typically the AAV and B19 coding regions have been deleted, resulting in a safe, 

noncytotoxic vector. The AAV ITRs, or modifications thereof, confer infectivity and site-specific 
integration., but not cytotoxicity, and the promoter directs cell-specific expression. United states 
Patent No. 6,261,834 is herein incorporated by reference for material related to the AAV vector. 

The vectors of the present invention thus provide DNA molecules which are capable of 
10 integration into a mammalian chromosome without substantial toxicity. 

The inserted genes in viral and retroviral usually contain promoters, and/or enhancers to 
help control the expression of the desired gene product. A promoter is generally a sequence or 
sequences of DNA that function when in a relatively fixed location in regard to the transcription 
start site. A promoter contains core elements required for basic interaction of RN A polymerase 
15 and transcription factors, and may contain upstream elements and response elements. 

(6) Large oavload viral vectors 
Molecular genetic experiments with large human herpesviruses have provided a means 
whereby large heterologous DNA fragments can be cloned, propagated and established in cells 
permissive for infection with herpesviruses (Sun et al., Nature genetics 8: 33-41, 1994; Cotter and 

20 Robertson, .Curr Opin Mol Ther 5: 633-644, 1999). These large DNA viruses (herpes simplex 
virus (HSV) and Epstein-Barr virus (EBV), have the potential to deliver fragments of human 
heterologous DNA> 150 kb to specific cells. EBV recombinants can maintain large pieces of 
DNA in the infected B-cells as episomal DNA. Individual clones carried human genomic inserts 
up to 330 kb appeared genetically stable The maintenance of these episomes requires a specific 

25 EBV nuclear protein, EBNA1, constitutively expressed during infection with EBV. Additionally, 
these vectors can be used for transfection, where large amounts of protein can be generated 
transiently in vitro. Herpesvirus amplicon systems are also being used to package pieces of DNA 
> 220 kb and to infect cells that can stably maintain DNA as episomes. 

Other useful systems include, for example, replicating and host-restriGted non-replicating 
30 vaccinia virus vectors. 

(2) Non-nucleic acid based systems 
The disclosed compositions can be delivered to the target cells in a variety of ways. For 
example, the compositions can be delivered through electroporation, or through lipofection, or 
through calcium phosphate precipitation. The delivery mechanism chosen will depend in part on 
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the type of cell targeted and whether the delivery is occurring For example in vivo or in vitro. 

Thus, the compositions can comprise, in addition to the disclosed nucleic acids and 
proteins or vectors for example, lipids such as liposomes, such as cationic liposomes (e.g., 
DOTMA, DOPE, DC-cholesterol) or anionic liposomes. Liposomes can further comprise proteins 
5 to facilitate targeting a particular cell, if desired. Administration of a composition comprising a 
compound and a cationic liposome can be administered to the blood afferent to a target organ or 
inhaled into the respiratory tract to target cells of the respiratory tract. Regarding liposomes, see, 
e.g., Brigham et al. Am. J. Resp. Cell. Mol. Biol 1:95-100 (1989); Feigner et al. Proc. Nail 
Acad. Sci USA 84:7413-7417 (1987); U.S. Pat. No.4,897,355. Furthermore, the compound can 
10 be administered as a component of a microcapsule that can be targeted to specific cell types, such 
as macrophages, or where the diffusion of the compound or delivery of the compound from the 
microcapsule is designed for a specific rate or dosage. 

In the methods described above which include the administration and uptake of exogenous 
DNA into the cells of a subject (i.e. : gene transduction or transfection), delivery of the 

15 compositions to cells can be via a variety of mechanisms. As one example, delivery can be via a 
liposome, using commercially available liposome preparations such as LIPOFECTIN, 
LIPOFECTAMll^E (GTBCO-BRL, Inc., Gaithersburg, MD) ; SUPERFECT (Qiagen, Inc. Hilden, 
Germany) and TRANSFECTAM (Promega Biotec, Inc., Madison, WI), as well as other liposomes 
developed according to procedures standard in the art. In addition, the nucleic acid or vector of 

20 this invention can be delivered in vivo by electroporation, the technology for which is available 
from Genetronics, Inc. (San Diego, CA) as well as by means of a SONOPORATION machine 
(ImaRx Pharmaceutical Corp., Tucson, AZ). 

The materials may be in solution or suspension (for example, incorporated into 
microparticles, liposomes, or cells). These may be targeted to a particular cell type via antibodies, 

25 receptors, or receptor ligands. The following references are examples of the use of this technology 
to target specific proteins to tumor tissue [Senter, et al., Bioconjugate Chern. . 2:447-451, (1991); 
Bagshawe, K.D., Br. J. Cancer , 60:275-281, (1989); Bagshawe, etal., Br. J. Cancer , 58:700-703, 
(1988); Senter, et al., Bioconjugate Cheim , 4:3-9, (1993); Battelli, et al, Cancer Immunol. 
Immunother. , 35:421-425, (1992); Pietersz and McKenzie. Immunolog. Reviews , 129:57-80, 

30 (1992); and Roffler, et al., Biochem. Pharmacol . 42:2062-2065, (1991)). These techniques can be 
used for a variety of other specific cell types. Vehicles, among others, include "stealth" and other 
antibody conjugated liposomes (including lipid mediated drug targeting to colonic carcinoma), 
receptor mediated targeting of DNA through cell specific ligands, lymphocyte directed tumor 
targeting, and highly specific therapeutic retroviral targeting of murine glioma cells in vivo. The 

35 following references are examples of the use of this technology to target specific proteins to tumor 
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tissue (Hughes et al., Cancer Research , 49:6214-6220, (1989); and Litzinger and Huang, 
Biochimica et Biophysica Acta , 1 104:179-187, (1992)). In general, receptors are involved in 
pathways of endocytosis, either constitutive or ligand induced. These receptors cluster in clathrin- 
coated pits, enter the cell via clathrin-coated vesicles, pass through an acidified endosome in which 
5 the receptors are sorted, and then either recycle to the cell surface, become stored intracellularly, 
or are degraded in lysosomes. The internalization pathways serve a variety of functions, such as 
nutrient uptake, removal of activated proteins, clearance of macromolecules, opportunistic entry of 
viruses and toxins, dissociation and degradation of ligand, and receptor-level regulation. Many 
receptors follow more than one intracellular pathway, depending on the cell type, receptor 
10 concentration, type of ligand, ligand valency, and ligand concentration. Molecular and cellular 
mechanisms of receptor-mediated endocytosis has been reviewed (Brown and Greene, DNA and 
Cell Biology 10:6,399-409 (1991)). 

Nucleic acids that are delivered to cells which are to be integrated into the host cell 
genome, typically contain integration sequences. These sequences are often viral related 
15 sequences, particularly when viral based systems are used. These viral integration systems can 
also be incorporated into nucleic acids which are to be delivered using a non-nucleic acid based 
system of deliver, such as a liposome, so that the nucleic acid contained in the delivery system can 
be come integrated into the host genome. 

Other general techniques for integration into the host genome include, for example, 
20 systems designed to promote homologous recombination with the host genome. These systems 
typically rely on sequence flanking the nucleic acid to be expressed that has enough homology 
with a target sequence within the host cell genome that recombination between the vector nucleic 
acid and the target nucleic acid takes place, causing the delivered nucleic acid to be integrated into 
the host genome. These systems and the methods necessary to promote homologous 
25 recombination are known to those of skill in the art. 

(3) In vivo/ex vivo 

As described above, the compositions can be administered in a pharmaceutically 
acceptable carrier and can be delivered to the subject§s cells in vivo and/or ex vivo by a variety of 
mechanisms well known in the art (e.g., uptake of naked DNA, liposome fusion, intramuscular 
30 injection of DNA via a gene gun, endocytosis and the like). 

If ex vivo methods are employed, cells or tissues can be removed and maintained outside 
the body according to standard protocols well known in the art. The compositions can be 
introduced into the cells via any gene transfer mechanism, such as, for example, calcium 
phosphate mediated gene delivery, electroporation, microinjection or proteoliposomes. The 
35 transduced cells can then be infused (e.g., in a pharmaceutically acceptable carrier) or 
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homotopically transplanted back into the subject per standard methods for the cell or tissue type. 
Standard methods are known for transplantation or infusion of various cells into a subject. 

It is understood that in certain embodiments, constructs which produce an integrin signal 
transduction inhibitor are driven by inducible promoters, rather than constitutive promoters. 
5 inducible systems provide certain advantages, to the expression of the disclosed constructs. Any 
inducible system can be used. Also disclosed are cells containing the inducible systems, described 
herein, and in the Examples, These cells, can be used as model systems in a wide variety of 
assays, as well as in vivo settings. 

q) Expression systems 

10 The nucleic acids that are delivered to cells typically contain expression controlling 

systems. For example, the inserted genes in viral and retroviral systems usually contain promoters, 
and/or enhancers to help control the expression of the desired gene product. A promoter is 
generally a sequence or sequences of DNA that function when in a relatively fixed location in 
regard to the transcription start site. A promoter contains core elements required for basic 

15 interaction of RNA polymerase and transcription factors, and may contain upstream elements and 
response elements. 

(1) Viral Promoters and Enhancers 

Preferred promoters controlling transcription from vectors in mammalian host cells may 
be obtained from various sources, for example, the genomes of viruses such as: polyoma, Simian 

20 Virus 40 (SV40), adenovirus, retroviruses, hepatitis-B virus and most preferably cytomegalovirus, 
or from heterologous mammalian promoters, e.g. beta actin promoter. The early and late 
promoters of the SV40 virus are conveniently obtained as an SV40 restriction fragment which also 
contains the SV40 viral origin of replication (Fiers et ah, Nature . 273: 113 (1978)). The 
immediate early promoter of the human cytomegalovirus is conveniently obtained as a Hin dlll E 

25 restriction fragment (Greenway, P.J. et al., Gene 18: 355-360 (1982)). Of course, promoters 
from the host cell or related species also are useful herein. 

Enhancer generally refers to a sequence of DNA that functions at no fixed distance from 
the transcription start site and can be either 5' (Laimins, L. et al., Proc. Natl. Acad. Sci, 78: 
993 (1981)) or 3* (Lusky, M.L., etaL Mol. Cell Bio. 3: 1108 (1983)) to the transcription unit. 

30 Furthermore, enhancers can be within an intron (Banerji, J.L. et ah, Cell 33: 729 (1983)) as well 
as within the coding sequence itself (Osborne, T.F., et al., Mol. Cell Bio. 4: 1293 (1984)). They 
are usually between 10 and 300 bp in length, and they function in cis. Enhancers function to 
increase transcription from nearby promoters. Enhancers also often contain response elements that 
mediate the regulation of transcription. Promoters can also contain response elements that mediate 

35 the regulation of transcription. Enhancers often determine the regulation of expression of a gene. 
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While many enhancer sequences are now known From mammalian genes (globin, elastase. 
albumin, ot- fetoprotein and insulin), typically one will use an enhancer from a eukaryotic cell virus 
for general expression. Preferred examples are the SV40 enhancer on the late side of the 
replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma 
5 enhancer on the late side of the replication origin, and adenovirus enhancers. 

The promoter and/or enhancer may be specifically activated either by light or specific 
chemical events which trigger their function. Systems can be regulated by reagents such as 
tetracycline and dexamethasone. There are also ways to enhance viral vector gene expression by 
exposure to irradiation, such as gamma irradiation, or alkylating chemotherapy drugs. 

1 0 In certain embodiments the promoter and/or enhancer region can act as a constitutive 

promoter and/or enhancer to maximize expression of the region of the transcription unit to be 
transcribed. In certain constructs the promoter and/or enhancer region be active in all eukaryotic 
cell types, even if it is only expressed in a particular type of cell at a particular time. A preferred 
promoter of this type is the CMV promoter (650 bases). Other preferred promoters are SV40 

15 promoters, cytomegalovirus (full length promoter), and retroviral vector LTP. 

It has been shown that all specific regulatory elements can be cloned and used to construct 
expression vectors that are selectively expressed in specific cell types such as melanoma cells. 
The glial fibrillary acetic protein (GFAP) promoter has been used to selectively express genes in 
cells of glial origin. 

20 Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human 

or nucleated cells) may also contain sequences necessary for the termination of transcription which 
may affect mRNA expression. These regions are transcribed as polyadenylated segments in the 
untranslated portion of the mRNA encoding tissue factor protein. The 3' untranslated regions also 
include transcription termination sites. It is preferred that the transcription unit also contain a 

25 polyadenylation region. One benefit of this region is that it increases the likelihood that the 
transcribed unit will be processed and transported like mRNA. The identification and use of 
polyadenylation signals in expression constructs is well established. It is preferred that 
homologous polyadenylation signals be used in the transgene constructs. In certain transcription 
units, the polyadenylation region is derived from the SV40 early polyadenylation signal and 

30 consists of about 400 bases. It is also preferred that the transcribed units contain other standard 
sequences alone or in combination with the above sequences improve expression from, or stability 
of, the construct. 

(2) Markers 

The viral vectors can include nucleic acid sequence encoding a marker product. This 
35 marker product is used to determine if the gene has been delivered to the cell and once delivered is 
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being expressed. Preferred marker genes are the E. Coli laeZ gene, which encodes 6- 
galactosidase, and green fluorescent protein. 

In some embodiments the marker may be a selectable marker. Examples of suitable 
selectable markers for mammalian cells are dihydrofolate reductase (DHFR), thymidine kinase, 
5 neomycin, neomycin analog G4 1 8 5 hydromycin, and puromycin. When such selectable markers 
are successfully transferred into a mammalian host cell, the transformed mammalian host cell can 
survive if placed under selective pressure. There are two widely used distinct categories of 
selective regimes. The first category is based on a cell's metabolism and the use of a mutant cell 
line which lacks the ability to grow independent of a supplemented media. Two examples are: 

10 CHO DHFR- cells and mouse LTK- cells. These cells lack the ability to grow without the 
addition of such nutrients as thymidine or hypoxanthine. Because these cells lack certain genes 
necessary for a complete nucleotide synthesis pathway, they cannot survive unless the missing 
nucleotides are provided in a supplemented media. An alternative to supplementing the media is 
to introduce an intact DHFR or TK gene into cells lacking the respective genes, thus altering their 

1 5 growth requirements. Individual cells which were not transformed with the DHFR or TK gene 
will not be capable of survival in non-supplemented media. 

The second category is dominant selection which refers to a selection scheme used in any 
cell type and does not require the use of a mutant cell line. These schemes typically use a drug to 
arrest growth of a host cell. Those cells which have a novel gene would express a protein 

20 conveying drug resistance and would survive the selection. Examples of such dominant selection 
use the drugs neomycin, (Southern P. and Berg, P.. J. Molec. Appl. Genet. 1: 327 (1982)), 
mycophenolic acid, (Mulligan, R.C. and Berg, P. Science 209: 1422 (1980)) or hygromycin, 
(Sugden,B. etal., Mol. Cell. Biol. 5: 410-413 (1985)). The three examples employ bacterial 
genes under eukaryotic control to convey resistance to the appropriate drug G418 or neomycin 

25 (geneticin), xgpt (mycophenolic acid) or hygromycin, respectively. Others include the neomycin 
analog G418 and puromycin. 

h) Peptides 

(1) Protein variants 

As discussed herein there are numerous variants of the beta4 integrin protein, alpha6 
30 integrin protein, and gamma2 laminin5 protein, for example, that are known and herein 

contemplated. In addition, to the known functional homologue variants there are derivatives of the 
beta4, alpha6, and gamma2, and other disclosed proteins which also function in the disclosed 
methods and compositions. Protein variants and derivatives are well understood to those of skill in 
the art and in can involve amino acid sequence modifications. For example, amino acid sequence 
35 modifications typically fall into one or more of three classes: substitutional, insertional or 
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deletional variants. Insertions include amino and/or carboxyl terminal fusions as well as 
intrasequence insertions of single or multiple amino acid residues. Insertions ordinarily will be 
smaller insertions than those of amino or carboxyl terminal fusions, for example, on the order of 
one to four residues. Immunogenic fusion protein derivatives are made by fusing a polypeptide 



recombinant cell culture transformed with DNA encoding the fusion. Deletions are characterized 
by the removal of one or more amino acid residues from the protein sequence. Typically, no more 
than about from 2 to 6 residues are deleted at any one site within the protein molecule. These 
variants ordinarily are prepared by site specific mutagenesis of nucleotides in the DNA encoding 

10 the protein, thereby producing DNA encoding the variant, and thereafter expressing the DNA in 
recombinant cell culture. Techniques for making substitution mutations at predetermined sites in 
DNA having a known sequence are well known, for example Ml 3 primer mutagenesis and PGR 
mutagenesis. Amino acid substitutions are typically of single residues, but can occur at a number 
of different locations at once; insertions usually will be on the order of about from 1 to 10 amino 

15 acid residues; and deletions will range about from 1 to 30 residues. Deletions or insertions 
preferably are made in adjacent pairs, i.e. a deletion of 2 residues or insertion of 2 residues. 
Substitutions, deletions, insertions or any combination thereof may be combined to arrive at a final 
construct. The mutations must not place the sequence out of reading frame and preferably will not 
create complementary regions that could produce secondary mRNA structure. Substitutional 

20 variants are those in which at least one residue has been removed and a different residue inserted 
in its place. Such substitutions generally are made in accordance with the following Tables 1 and 
2 and are referred to as conservative substitutions. 



5 sufficiently large to confer immunogenicity to the target sequence by cross-linking in vitro or by 



TABLE 1 : 



Amino Acid Abbreviations 



Amino Acid 
25 alanine 



Ala 

Alle 

Arg 

Asn 

Asp 

Cys 

Glu 

Gin 

Gly 

His 

He 

Leu 

Lys 

Phe 

Pro 

pGlu 



Abbreviations 



A 



allosoleucine 



35 isolelucine 
leucine 
lysine 

phenylalanine 
proline 



30 cysteine 

glutamic acid 
glutamine 
glycine 
histidine 



arginine 



asparagine 
asp art ic acid 



R 

N 

D 

C 

E 

K 

G 

H 

I 

L 

K 

F 

P 



40 pyroglutamic acid 
serine 
threonine 



Ser 
Thr 



S 
T 
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tyrosine 


Tyr 




Y 




tryptophan 




Trp 


W 




valine 




Val 


V 


5 


TABLE 2: 


Amino 


Acid Substitutions 




Original Residue 


Exemplary Conservat 




Ala 






ser 




Arg 






lys, cln 




Asn 






gin; his 


10 


Asp 






glu 




Cys 






ser 




Gin 






asn, lys 




Glu 






asp 




Gly 






pro 


15 


His 






asn;gln 




He 






leu; val 




Leu 






ile; val 




Lys 






arg; gin; 




Met 






Leu; ile 


20 


Phe 






met; leu; tyr 




Ser 






thr 




Thr 






ser 




Trp 






tyr 




Tyr 






trp; phe 


25 


Val 






ile; leu 



Substantial changes in function or immunological identity are made by selecting 
substitutions that are less conservative than those in Table 2, i.e., selecting residues that differ 
more significantly in their effect on maintaining (a) the structure of the polypeptide backbone in 
the area of the substitution, for example as a sheet or helical conformation, (b) the charge or 

30 hydrophobicity of the molecule at the target site or (c) the bulk of the side chain. The substitutions 
which in general are expected to produce the greatest changes in the protein properties will be 
those in which (a) a hydrophilic residue, e.g. seryl or threonyl, is substituted for (or by) a 
hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a cysteine or proline 
is substituted for (or by) any other residue; (c) a residue having an electropositive side chain, e.g., 

35 lysyl, arginyl, or histidyl, is substituted for (or by) an electronegative residue, e.g., glutamyl or 
aspartyl; or (d) a residue having a bulky side chain, e.g., phenylalanine, is substituted for (or by) 
one not having a side chain, e.g., glycine, in this case, (e) by increasing the number of sites for 
sulfation and/or glycosylation. 

For example, the replacement of one amino acid residue with another that is biologically 
40 and/or chemically similar is known to those skilled in the art as a conservative substitution. For 
example, a conservative substitution would be replacing one hydrophobic residue for another, or 
one polar residue for another. The substitutions include combinations such as, for example, Gly, 
Ala; Vat, He, Leu; Asp, Glu: Asn, Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. Such conservatively 
substituted variations of each explicitly disclosed sequence are included within the mosaic 
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polypeptides provided herein. 

Substitutional or deletional mutagenesis can be employed to insert sites for N- 
glycosylation (Asn-X-Thr/Ser) or O-glyeosylation (Ser or Thr). Deletions of cysteine or other 
labile residues also may be desirable. Deletions or substitutions of potential proteolysis sites, e.g. 
5 Arg, is accomplished for example by deleting one of the basic residues or substituting one by 
glutaminyl or histidyl residues. 

Certain post-translational derivatizations are the result of the action of recombinant host 
cells on the expressed polypeptide. Glutaminyl and asparaginyl residues are frequently post- 
translationally deamidated to the corresponding glutamyl and asparyl residues. Alternatively, 

10 these residues are deamidated under mildly acidic conditions. Other post-translational 

modifications include hydroxylation of proline and lysine, phosphorylation of hydroxy 1 groups of 
seryl or threonyl residues, methylation of the o-amino groups of lysine, arginine, and histidine side 
chains (T.E. Creighton, Proteins: Structure and Molecular Properties, W. H. Freeman & Co., 
San Francisco pp 79-86 [1983]), acetylation of the N-terminal amine and, in some instances, 

15 amidation of the C-terminal uarboxyl. 

It is understood that one way to define the variants and derivatives of the disclosed 
proteins herein is through defining the variants and derivatives in terms of homology/identity to 
specific known sequences. For example, SEQ ID NO:SEQ ID NO:5 sets forth a particular 
sequence of beta4 integrin cDNA and SEQ ID NO:6 sets forth a particular sequence of a beta4 
20 integrin protein. Specifically disclosed are variants of these and other proteins herein disclosed 
which have at least, 70% or 75% or 80% or 85% or 90% or 95% homology to the stated sequence. 
Those of skill in the art readily understand how to determine the homology of two proteins. For 
example, the homology can be calculated after aligning the two sequences so that the homology is 
at its highest level. 

25 Another way of calculating homology can be performed by published algorithms. Optimal 

alignment of sequences for comparison may be conducted by the local homology algorithm of 
Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the homology alignment algorithm of 
Needleman and Wunsch, J. MoL Biol. 48: 443 (1970)., by the search for similarity method of 
Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85: 2444 (1988), by computerized 

30 implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin 
Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by 
inspection. 

The same types of homology can be obtained for nucleic acids by for example the 
algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc, NatL Acad. ScL 
35 USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymol 183:281-306, 1 989 which are herein 
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incorporated by reference for at least material related to nucleic acid alignment. 

It is understood that the description of conservative mutations and homology can be 
combined together in any combination, such as embodiments that have at least 70% homology to a 
particular sequence wherein the variants are conservative mutations. 

5 As this specification discusses various proteins and protein sequences it is understood that 

the nucleic acids that can encode those protein sequences are also disclosed. This would include 
all degenerate sequences related to a specific protein sequence, i.e. all nucleic acids having a 
sequence that encodes one particular protein sequence as well as all nucleic acids, including 
degenerate nucleic acids, encoding the disclosed variants and derivatives of the protein sequences. 

10 Thus, while each particular nucleic acid sequence may not be written out herein, it is understood 
that each and every sequence is in fact disclosed and described herein through the disclosed 
protein sequence. For example, one of the many nucleic acid sequences that can encode the 
protein sequence set forth in SEQ ID NO;6 is set forth in SEQ ID NO:5 , Another nucleic acid 
sequence that encodes the same protein sequence set forth in SEQ ID NO:6 is set forth in SEQ ID 

15 NO: 16. Inaddit ion, for example, a disclosed conservative derivative of SEQ ID NO:2 is shown in 
SEQ ID NO: 17, where the valine (V) at position 34 is changed to a isoleucine (I). It is understood 
that for this mutation all of the nucleic acid sequences that encode this particular derivative of the 
beta4 integrin are also disclosed including for example SEQ ID NO: 1 8 and SEQ ID NO: 19 which 
set forth two of the degenerate nucleic acid sequences that encode the particular polypeptide set 

20 forth in SEQ ID NO: 17. It is also understood that while no amino acid sequence indicates what 
particular DNA sequence encodes that protein within an organism, where particular variants of a 
disclosed protein are disclosed herein, the known nucleic acid sequence that encodes that protein 
in the particular organism from which that protein arises is also known and herein disclosed and 
described. 

25 Q Pharmaceutical carriers/Delivery of pharmaceutical products 

As described above, the compositions, for example, compositions that inhibit alpha6 
function, beta4 function, or gamma2 function can also be administered in vivo in a 
pharmaceutically acceptable carrier. By "pharmaceutically acceptable" is meant a material that is 
not biologically or otherwise undesirable, i.e., the material may be administered to a subject, along 
30 with the nucleic acid or vector, without causing any undesirable biological effects or interacting in 
a deleterious manner with any of the other components of the pharmaceutical composition in 
which it is contained. The carrier would naturally be selected to minimize any degradation of the 
active ingredient and to minimize any adverse side effects in the subject, as would be well known 
to one of skill in the art. 

35 The compositions may be administered orally, parenterally (e.g., intravenously), by 
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intramuscular injection, by intraperitoneal injection, transdermal ly, extracorporeal^, topically or 
the like, although topical intranasal administration or administration by inhalant is typically 
preferred. As used herein, "topical intranasal administration" means delivery of the compositions 
into the nose and nasal passages through one or both of the nares and can comprise delivery by a 
5 spraying mechanism or droplet mechanism, or through aerosolization of the nucleic acid or vector, 
The latter may be effective when a large number of animals is to be treated simultaneously. 
Administration of the compositions by inhalant can be through the nose or mouth via delivery by a 
spraying or droplet mechanism. Delivery can also be directly to any area of the respiratory system 
(e.g., lungs) via intubation. The exact amount of the compositions required will vary from subject 
10 to subject, depending on the species, age, weight and general condition of the subject, the severity 
of the allergic disorder being treated, the particular nucleic acid or vector used, its mode of 
administration and the like. Thus, it is not possible to specify an exact amount for every 
composition. However, an appropriate amount can be determined by one of ordinary skill in the 
art using only routine experimentation given the teachings herein. 

15 Parenteral administration of the composition, if used, is generally characterized by 

injection. Injectables can be prepared in conventional forms, either as liquid solutions or 
suspensions, solid forms suitable for solution of suspension in liquid prior to injection, or as 
emulsions. A more recently revised approach for parenteral administration involves use of a slow 
release or sustained release system such that a constant dosage is maintained. See, e.g., U.S. 

20 Patent No. 3,610,795, which is incorporated by reference herein. 

The materials maybe in solution, suspension (for example, incorporated into 
microparticles, liposomes, or cells). These may be targeted to a particular cell type via antibodies, 
receptors, or receptor ligands. The following references are examples of the use of this technology 
to target specific proteins to tumor tissue (Senter, et al. 5 Bioconiugate Chem. , 2:447-45] , (1991); 

25 Bagshawe, K.D., Br. J. Cancer , 60:275-281, (1989); Bagshawe, et aL, Br. J. Cancer , 58:700-703, 
(1988); Senter, et aL, Bioconiugate Chetru 4:3-9, (1993); Battelli, et aL, Cancer Immunol. 
Immunother. . 35:421-425, (1992); Pietersz and McKenzie. Immunology Reviews . 129:57-80, 
(1992); and Roffler, et aL, Biochem. Pharmacol , 42:2062-2065, (1991)). Vehicles such as 
"stealth" and other antibody conjugated liposomes (including lipid mediated drug targeting to 

30 colonic carcinoma), receptor mediated targeting of DNA through cell specific ligands, lymphocyte 
directed tumor targeting, and highly specific therapeutic retroviral targeting of murine glioma cells 
in vivo. The following references are examples of the use of this technology to target specific 
proteins to tumor tissue (Hughes et aL, Cancer Research , 49:6214-6220, (1989); and Litzinger and 
Huang, Biochimica et Biophvsica Acta , 1104:179-187, (1992)). In general, receptors are involved 

35 in pathways of endocytosis, either constitutive or ligand induced. These receptors cluster in 

clathrin-coatcd pits, enter the cell via clathrin-coated vesicles, pass through an acidified endosome 

54 



WO 02/30465 



PCT/US01/32127 



in which the receptors are sorted, and then either recycle to the cell surface, become stored 
intracellularly, or are degraded in lysosornes. The internalization pathways serve a variety of 
functions, such as nutrient uptake, removal of activated proteins, clearance of macromolecules, 
opportunistic entry of viruses and toxins, dissociation and degradation of ligand, and receptor-level 
5 regulation. Many receptors follow more than one intracellular pathway, depending on the cell 
type, receptor concentration, type of ligand, ligand valency, and ligand concentration. Molecular 
and cellular mechanisms of receptor-mediated endocytosis has been reviewed (Brown and Greene, 
DNA and Cell Biology 10:6,399-409 (1991)). 

(1) Pharmaceutical^ Acceptable Carriers 
10 The compositions, including antibodies, can be used therapeutically in combination with a 

pharmaceutical ly acceptable carrier. 

Pharmaceutical carriers are known to those skilled in the art. These most typically would 
be standard carriers for administration of drugs to humans, including solutions such as sterile 
water, saline, and buffered solutions at physiological pH. The compositions can be administered 
1 5 intramuscularly or subcutaneously. Other compounds will be administered according to standard 
procedures used by those skilled in the art. 

Pharmaceutical compositions may include carriers, thickeners, diluents, buffers, 
preservatives, surface active agents and the like in addition to the molecule of choice. 
Pharmaceutical compositions may also include one or more active ingredients such as antimicrobial 
20 agents, anti-inflammatory agents, anesthetics, and the like. 

The pharmaceutical composition may be administered in a number of ways depending on 
whether local or systemic treatment is desired, and on the area to be treated. Administration may be 
topically (including ophthalmic ally, vaginally, rectally, intranasally), orally, by inhalation, or 
parenterally, for example by intravenous drip, subcutaneous, intraperitoneal or intramuscular 
25 injection. The disclosed antibodies can be administered intravenously, intraperitoneal ly, 
intramuscularly, subcutaneously, intracavity, or transdermal ly. 

Preparations for parenteral administration include sterile aqueous or non-aqueous 
solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, 
polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl 
30 oleate. Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or suspensions, 
including saline and buffered media. Parenteral vehicles include sodium chloride solution, 
Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's, or fixed oils. Intravenous 
vehicles include fluid and nutrient replenishers, electrolyte replenishes (such as those based on 
Ringer's dextrose), and the like. Preservatives and other additives may also be present such as, for 
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example, antimicrobials, antioxidants, chelating agents, and inert gases and the like. 

Formulations for topical administration may include ointments, lotions, creams, gels, drops, 
suppositories, sprays, liquids and powders. Conventional pharmaceutical carriers, aqueous, powder 
or oily bases, thickeners and the like may be necessary or desirable. 

5 Compositions for oral administration include powders or granules, suspensions or solutions 

in water or non-aqueous media, capsules, sachets, or tablets. Thickeners, flavorings, diluents, 
emulsifiers, dispersing aids or binders may be desirable. 

Some of the compositions may potentially be administered as a pharmaceutioally 
acceptable acid- or base- addition salt, formed by reaction with inorganic acids such as 

10 hydrochloric acid, hydrobromic acid, perchloric acid, nitric acid, thiocyanic acid, sulfuric acid, and 
phosphoric acid, and organic acids such as formic acid, acetic acid, propionic acid, glycolic acid, 
lactic acid, pyruvic acid, oxalic acid, malonic acid, succinic acid, maleic acid, and fumaric acid, or 
by reaction with an inorganic base such as sodium hydroxide, ammonium hydroxide, potassium 
hydroxide, and organic bases such as mono-, di- ? trialkyl and aryl amines and substituted 

15 ethanolamines. 

(2) Therapeutic Uses 
The dosage ranges for the administration of the compositions are those large enough to 
produce the desired effect in which the symptoms disorder are effected. The dosage should not be 
so large as to cause adverse side effects, such as unwanted cross-reactions, anaphylactic reactions, 
20 and the like. Generally, the dosage will vary with the age, condition, sex and extent of the disease 
in the patient and can be determined by one of skill in the art. The dosage can be adjusted by the 
individual physician in the event of any counterindications, Dosage can vary, and can be 
administered in one or more dose administrations daily, for one or several days. 

Other compositions which do not have a specific pharmaceutical function, but which may 
25 be used for tracking changes within cellular chromosomes or for the delivery of diagnostic tools 
for example can be delivered in ways similar to those described for the pharmaceutical products. 

\) Chips and micro arrays 
Disclosed are chips where at least one address is the sequences or part of the sequences set 
forth in any of the nucleic acid sequences disclosed herein. Also disclosed are chips where at least 
30 one address is the sequences or portion of sequences set forth in any of the peptide sequences 
disclosed herein. 

Also disclosed are chips where at least one address is a variant of the sequences or part of 
the sequences set forth in any of the nucleic acid sequences disclosed herein. Also disclosed are 
chips where at least one address is a variant of the sequences or portion of sequences set forth in 
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any of the peptide sequences disclosed herein. 

k) Computer readable mediums 

ft is understood that the disclosed nucleic acids and proteins can be represented as a 
sequence consisting of the nucleotides of amino acids. There are a variety of ways to display these 
5 sequences, for example the nucleotide guanosine can be represented by G or g. Likewise the 
amino acid valine can be represented by Val or V. Those of skill in the art understand how to 
display and express any nucleic acid or protein sequence in any of the variety of ways that exist, 
each of which is considered herein disclosed. Specifically contemplated herein is the display of 
these sequences on computer readable mediums, such as, commercially available floppy disks, 
10 tapes, chips, hard drives, compact disks, and video disks, or other computer readable mediums. 
Also disclosed are the binary code representations of the disclosed sequences. Those of skill in the 
art understand what computer readable mediums. Thus, computer readable mediums on which the 
nucleic acids or protein sequences are recorded, stored, or saved. 

Disclosed are computer readable mediums comprising the sequences and information regarding 
15 the sequences set forth herein. Also disclosed are computer readable mediums comprising the 
sequences and information regarding the sequences set forth herein wherein the sequences do not 
include SEQ ID NOs: SEQ IDNOs:l-19. 

\) Kits 

Disclosed herein are kits that are drawn to reagents that can be used in practicing the 
20 methods disclosed herein. The kits can include any reagent or combination of reagent discussed 
herein or that would be understood to be required or beneficial in the practice of the disclosed 
methods. For example, the kits could include primers to perform the amplification reactions 
discussed in certain embodiments of the methods, as well as the buffers and enzymes required to 
use the primers as intended. For example, disclosed is a kit for assessing a subject's risk for 
25 acquiring cancer, such as colon cancer, comprising the primers or probes that hybridize to the 
sequences set forth in SEQ ID NOs: 1, 3, 5, 7, 9, 1 1, and 13, for example. 

m) Compositions with similar functions 
It is understood that the compositions disclosed herein have certain functions, such as 
inhibiting gamma2 function or binding alpha6 integrin or inhibiting beta4 function. Disclosed 
30 herein are certain structural requirements for performing the disclosed functions, and it is 

understood that there are a variety of structures which can perform the same function which are 
related to the disclosed structures, and that these structures will ultimately achieve the same result, 
for example stimulation or inhibition alpha6beta4 signaling or interruption of the alpha6beta4 
signaling pathway. 
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2. Methods of making the compositions 

The compositions disclosed herein and the compositions necessary to perform the 
disclosed methods can be made using any method known to those of skill in the art for that 
particular reagent or compound unless otherwise specifically noted. 

5 a) Nucleic acid synthesis 

For example, the nucleic acids, such as, the oligonucleotides to be used as primers can be 
made using standard chemical synthesis methods or can be produced using enzymatic methods or 
any other known method. Such methods can range from standard enzymatic digestion followed by 
nucleotide fragment isolation (see for example,, Sambrook et aL 3 Molecular Cloning: A Laboratory 

10 Manual, 2nd Edition (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989) 
Chapters 5, 6) to purely synthetic methods, for example, by the cyanoethyl phosphoramidite 
method using a Milligen or Beckman System lPlus DNA synthesizer (for example, Model 8700 
automated synthesizer of Milligen-Biosearch, Burlington, MA or ABI Model 380B). Synthetic 
methods useful for making oligonucleotides are also described by Ikuta et aL, Ann. Rev. Biochem. 

15 53:323-356 (1984), (phosphotri ester and phosphite-triester methods), and Narang et aL, Methods 
EnzymoL, 65:610-620 (1980), (phosphotriester method). Protein nucleic acid molecules can be 
made using known methods such as those described by Nielsen et aL, Bioconjug, Chem. 5:3-7 
(1994). 

b) Peptide synthesis 

20 One method of producing the disclosed proteins, or fragments of the disclosed proteins, 

such as a fragment of SEQ ID NO:6, is to link two or more peptides or polypeptides together by 
protein chemistry techniques. For example, peptides or polypeptides can be chemically 
synthesized using currently available laboratory equipment using either Fmoc (9- 
fluorenylmethyloxycarbonyl) or Boc (tert -butyloxycarbonoyl) chemistry. (Applied Biosystems, 

25 Inc., Foster City, CA). One skilled in the art can readily appreciate that a peptide or polypeptide 
corresponding to the disclosed proteins, for example, can be synthesized by standard chemical 
reactions. For example, a peptide or polypeptide can be synthesized and not cleaved from its 
synthesis resin whereas the other fragment of a peptide or protein can be synthesized and 
subsequently cleaved from the resin, thereby exposing a terminal group which is functionally 

30 blocked on the other fragment. By peptide condensation reactions, these two fragments can be 
covalently joined via a peptide bond at their carboxyl and amino termini, respectively, to form an 
antibody, or fragment thereof. (Grant GA (1992) Synthetic Peptides: A User Guide. W.H, 
Freeman and Co., N.Y. (1992); Bodansky M and Trost B., Ed. (1993) Principles of Peptide 
Synthesis. Springer- Verlag Inc., NY (which is herein incorporated by reference at least for 

35 material related to peptide synthesis). Alternatively, the peptide or polypeptide is independently 
synthesized vivo as described herein. Once isolated, these independent peptides or polypeptides 

58 



WO 02/30465 PCT/US01/32127 

may be linked to form a peptide or fragment thereof via similar peptide condensation reactions. 

For example, enzymatic ligation of cloned or synthetic peptide segments allow relatively 
short peptide fragments to be joined to produce larger peptide fragments, polypeptides or whole 
protein domains (Abrahmsen L et ah, Biochemistry, 30:4151 (1991)). Alternatively, native 
5 chemical ligation of synthetic peptides can be utilized to synthetically construct large peptides or 
polypeptides from shorter peptide fragments. This method consists of a two step chemical reaction 
(Dawson et al. Synthesis of Proteins by Native Chemical Ligation. Science, 266:776-779 (1994)). 
The first step is the chemoselective reaction of an unprotected synthetic peptide— thioester with 
another unprotected peptide segment containing an amino-terminal Cys residue to give a thioester- 
10 linked intermediate as the initial covalent product. Without a change in the reaction conditions, 
this intermediate undergoes spontaneous, rapid intramolecular reaction to form a native peptide 
bond at the ligation site (Baggiolini M et al. (1992) FEB S Lett. 307:97-101; Clark-Lewis I et a 1., 
J.Biol.Cherm, 269:16075 (1994); Clark-Lewis I et ah, Biochemistry, 30:3128 (1991); Rajarathnam 
K et al., Biochemistry 33:6623-30 (1994)). 

15 Alternatively, unprotected peptide segments are chemically linked where the bond formed 

between the peptide segments as a result of the chemical ligation is an unnatural (non-peptide) 
bond (Schnolzer, M et al. Science, 256:221 (1992)). This technique has been used to synthesize 
analogs of protein domains as well as large amounts of relatively pure proteins with full biological 
activity (deLisle Milton RC et al., Techniques in Protein Chemistry IV. Academic Press, New 

20 York, pp. 257-267 (1992)). 

The disclosed proteins and polypeptides such as that for SEQ ID NO: 6, beta4 integrin, can be 
made using any traditional recombinant biotechnology method. Examples of such methods can be 
found in Sambrook et al. which is herein incorporated by reference at least for material related to 
production of proteins and antibodies. 

25 c) Processes for making the compositions 

Disclosed are processes for making the compositions as well as making the intermediates 
leading to the compositions. For example, disclosed are nucleic acids in SEQ ID NOs:l, 3, 5, 7, 9, 
1 1, and 13. There are a variety of methods that can be used for making these compositions, such 
as synthetic chemical methods and standard molecular biology methods. It is understood that the 

30 methods of making these and the other disclosed compositions are specifically disclosed. 

Disclosed are nucleic acid molecules produced by the process comprising linking in an 
operative way a nucleic acid molecule comprising the sequence set forth in SEQ ID Nos: 1 , 3, 5, 7, 
9, 1 1, or 13 or a fragment thereof, and a sequence controlling the expression of the nucleic acid. 

Also disclosed are nucleic acid molecules produced by the process comprising linking in 
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an operative way a nucleic acid molecule comprising a sequence having at least 80% identity to a 
sequence set forth in SEQ ID Nos: 1 , 3, 5, 7, 9, 1 1 , or 1 3 or a fragment thereof, and a sequence 
controlling the expression of the nucleic acid. 

Disclosed are nucleic acid molecules produced by the process comprising linking in an 
5 operative way a nucleic acid molecule comprising a sequence that hybridizes under stringent 
hybridization conditions to a nucleic acid molecule comprising the sequence set forth in SEQ ID 
Nos:l, 3 } 5, 7, 9, 1 1, or 13 or a fragment thereof, and a sequence controlling the expression of the 
nucleic acid. 

Disclosed are nucleic acid molecules produced by the process comprising linking in an 
10 operative way a nucleic acid molecule comprising a sequence encoding a peptide set forth in SEQ 
ID Nos:2, 4, 6, 8, 10, 12, or 14 or a fragment thereof, and a sequence controlling an expression of 
the nucleic acid molecule. 

Disclosed are nucleic acid molecules produced by the process comprising linking in an 
operative way a nucleic acid molecule comprising a sequence encoding a peptide having 80% 
15 identity to a peptide set forth in SEQ ID Nos:2, 4 ? 6, 8, 10, 12, or 14 or a fragment thereof and a 
sequence controlling an expression of the nucleic acid molecule. 

Disclosed are nucleic acids produced by the process comprising linking in an operative 
way a nucleic acid molecule comprising a sequence encoding a peptide having 80% identity to a 
peptide set forth in SEQ ID Nos:2 5 4, 6, 8, 10, 12, or 14 or a fragment thereof, wherein any change 
20 from the sequences set forth in SEQ ID Nos:2, 4, 6, 8, 10, 12, or 14 or a fragment thereof are 
conservative changes and a sequence controlling an expression of the nucleic acid molecule. 

Disclosed are cells produced by the process of transforming the cell with any of the 
disclosed nucleic acids. Disclosed are cells produced by the process of transforming the cell with 
any of the non-naturally occurring disclosed nucleic acids. 

25 Disclosed are any of the disclosed peptides produced by the process of expressing any of 

the disclosed nucleic acids. Disclosed are any of the non-naturally occurring disclosed peptides 
produced by the process of expressing any of the disclosed nucleic acids. Disclosed are any of the 
disclosed peptides produced by the process of expressing any of the non-naturally disclosed 
nucleic acids. 

30 Disclosed are animals produced by the process of transfecting a cell within the animal with 

any of the nucleic acid molecules disclosed herein. Disclosed are animals produced by the process 
of transfecting a cell within the animal any of the nucleic acid molecules disclosed herein, wherein 
the animal is a mammal. Also disclosed are animals produced by the process of transfecting a cell 
within the animal any of the nucleic acid molecules disclosed herein, wherein the mammal is 
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mouse, rat rabbit, cow, sheep, pig, or primate. 

Also disclose are animals produced by the process of adding to the animal any of the cells 
disclosed herein. 

d) Products produced from selection protocols 
5 Also disclosed are methods of obtaining molecules that act as functional regulators of 

integrin function, integrin receptor function, and functional regulators of signaling pathways 
related to integrin receptors, in particular integrin alpha6beta4. 

Disclosed are methods for isolating molecules that interact with the proteins set forth in 
SEQ ID Nos:2, 4, 6, 8, 10, 12, or 14 or a fragment thereof comprising, interacting a library of 
10 molecules with the proteins set forth in SEQ ID Nos;2, 4, 6, 8, 10, 12, or 1 4 or a fragment thereof, 
removing the unbound molecules, and collecting the molecules that are bound to at least one of the 
proteins set forth in SEQ ID Nos:2, 4, 6, 8, 10, 12, or 14 or a fragment thereof. 

3. Methods of using the compositions 

a) Methods of using the compositions as research tools 

15 The compositions can be used for example as targets in combinatorial chemistry protocols 

or other screening protocols to isolate molecules that possess desired functional properties related 
to A6B4 and A6B1 signaling pathways. 

The disclosed compositions can also be used diagnostic tools related to diseases such as 
cancer, such as colon cancer, 

20 The disclosed compositions can be used as discussed herein as either reagents in micro 

arrays or as reagents to probe or analyze existing microarrays. The disclosed compositions can be 
used in any known method for isolating or identifying single nucleotide polymorphisms. The 
compositions can also be used in any method for determining allelic analysis of for example, beta4 
alleles having varying function, particularly allelic analysis as it relates to beta4 signaling and 

25 functions. The compositions can also be used in any known method of screening assays, related to 
chip/micro arrays. The compositions can also be used in any known way of using the computer 
readable embodiments of the disclosed compositions, for example, to study relatedness or to 
perform molecular modeling analysis related to the disclosed compositions. 

b) Methods for affecting cancer 

30 The disclosed compositions can be used to affect the growth of cancer cells because as 

disclosed herein, the disclosed relationships are fundamental to the ability of cancer cells to 
continue growing. The disclosed compositions, such as antisense constructs that will inhibit the 
production of either alpha6, beta4, or the gamma2 chain of laminin5 reduce the proliferation of 
cancer cells. In fact, the compositions cannot only reduce the proliferation of the cancer cells, but 
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the compositions can kill the cancer cells, as shown herein. 

It is understood that cancer is caused by a variety of cellular events, of which certain 
events related to alpha6 integrin (up regulated), beta4 integrin (upregulated) and gamma2 
(upregulated) allow the continued viability of cancer cells, and that interference of these 
5 upregulated molecules inhibits the growth and kills the cancer cells. However, there are other 
known events that can cause non-cancerous cells to become oncogenic. For example, Abl, Ras, 
EGF receptor, ErB-2, APC, beta-catenin 3 Arf, Mdm2, p53, Rb, Myc are known to be involved in 
oncogenesis, and some of these molecules (For exmaple, Ras. APC loss, p53 loss) are directly 
related to the disclosed target signal transduction pathways mediated by the alpha6beta4 receptor. 
1 0 Just as the presently disclosed compositions can be used as therapeutics targeting the disclosed 
relationships, so to there are other targets (For example, Abl, ErB-2) for which pharmaceutical 
compositions have been developed (For example, Glivec, Herceptin, respectively). 

It is understood that the disclosed anti-cancer compositions can be used in combination 
with other anti-cancer compositions. Great benefits can be obtained from using anti-cancer 
1 5 compositions that target different molecules, in the same signal transduction pathway as well as, or 
in addition to, targeting molecules in different signal transduction pathways than those disclosed 
herein. Thus, the disclosed compositions which can affect the growth of cancer cells, indeed kill 
cancer cells, can be used in conjunction with any other chemotherapy, radiation, or any other anti- 
cancer therapy. 

20 Disclosed are methods of reducing the proliferation of a cancer cell which comprises 

inhibiting ligand binding to an integrin receptor on the cancer cell, wherein the integrin receptor 
comprises an integrin. 

Also disclosed are methods of reducing the proliferation of a cancer cell which comprises 
reducing integrin-integrin interaction, integrin receptor clustering interaction, or integrin-non- 
25 integrin protein interaction. 

Also disclosed are methods of selectively reducing the proliferation of cancer cells which 
comprises reducing integrins from interacting with one another, integrins from clustering, or 
integrins from interacting with other proteins associated with cancer cells. 

Disclosed are methods of reducing the proliferation of a cancer cell which comprises 
30 reducing the production of an integrin by the cancer cell. 

Also disclosed are methods of reducing the proliferation of a cancer cell which comprises 
reducing the production of an integrin receptor ligand by the cancer cell. 

Disclosed are methods of reducing the proliferation of a cancer cell which comprises 
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interfering with an integrin signaling pathway. 

Also disclosed are methods of selectively killing or reducing the proliferation of cancer 
cells which comprises inhibiting ligand binding to integrin receptors on the cancer cells, wherein 
the integrin receptor comprises a B4 integrin. 

5 Disclosed are methods of selectively killing or reducing the proliferation of cancer cells 

which comprises inhibiting ligand binding to integrin receptors on the cancer cells, wherein the 
ligand comprises a laminia5. 

Also disclosed are methods of selectively killing or reducing the proliferation of cancer 
cells which comprises inhibiting ligand binding to integrin receptors on the cancer cells, wherein 
10 the ligand comprises the gamma2 subunit of the laminin L5. 

Disclosed are methods of selectively killing or reducing the proliferation of cancer cells 
which comprises inhibiting ligand binding to integrin receptors on the cancer cells, wherein the 
integrin receptor comprises an alpha6 integrin. 

Further disclosed are methods of selectively killing or reducing the proliferation of cancer 
15 cells which comprises preventing integrin receptor subunits from interacting with one another, 
preventing integrin clustering, or preventing integrin receptor subunits from interacting with other 
proteins on cancer cells, wherein the integrin receptor comprises a B4 integrin. 

Disclosed are methods of selectively killing or reducing the proliferation of cancer cells 
which comprises reducing the production of laminin by the cancer cells, wherein the laminin 
20 comprises Iaminin5 s and/or any of the subunits of laminin5 7 such as gamma2. 

Also disclosed is a method of selectively killing or reducing the proliferation of cancer 
cells which comprises interfering with an integrin signaling pathway, wherein the integrin 
signaling pathway comprises a B4 integrin or an alpha6 integrin or a betal integrin or a laminin or 
a lamininS or the gamma2 subunit of laminin5 k 

25 Also dislcosed are methods, wherein the integrin receptor comprises integrin B4 and/or 

wherein the integrin receptor comprises integrin AG, and/or wherein the ligand that binds to the 
integrin receptor is laminin5, and/or wherein the integrin receptor is A6B4, and/or wherein the 
ligand comprises laminin5 } and/or wherein the ligand comprises the gamma-2 subunit of laminin5, 
and/or wherein inhibiting ligand binding to an integrin receptor does not occur by using an 

30 antisense molecule to A6, and/or wherein inhibiting ligand binding to an integrin receptor 

comprises contacting a A6 integrin with a composition that inhibits ligand binding, and/or wherein 
inhibiting ligand binding to an integrin receptor comprises contacting a B4 integrin with a 
composition that inhibits ligand binding, and/or wherein inhibiting ligand binding to an integrin 
receptor comprises contacting a laminin5 with a composition that inhibits ligand binding, and/or 
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wherein inhibiting ligand binding to an integrin receptor comprises contacting a gamma- 2 subunit 
with a composition that inhibits ligand binding, and/or wherein reducing integrin-integrin 
interaction, integrin receptor clustering interaction, or integrin-non-integrin protein interaction 
does not occur by using a B4-delta-cyt, and/or wherein reducing integrin-integrin interaction, 
5 integrin receptor clustering interaction, or integrin-non-integrin protein interaction does not occur 
by using an antisense molecule to A6, and/or wherein reducing integrin-integrin interaction, 
integrin receptor clustering interaction, or integrin-non-integrin protein interaction comprises 
contacting a A6 integrin with a composition that inhibits an interaction between the B4 integrin 
and another integrin or protein molecule, and/or wherein reducing integrin-integrin interaction, 

10 integrin receptor clustering interaction, or integrin-non-integrin protein interaction comprises 
contacting a B4 integrin with a composition that inhibits the interaction between the B4 integrin 
and another integrin or protein molecule, and/or wherein reducing integrin-integrin interaction, 
integrin receptor clustering interaction, or integrin-non-integrin protein interaction comprises 
contacting a larninin5 with a composition that inhibits ligand binding, and/or wherein reducing 

15 integrin-integrin interaction, integrin receptor clustering interaction, or integrin-non-integrin 

protein interaction comprises contacting a gamma2 subunit with a composition that inhibits ligand 
binding, and/or wherein the non-integrin protein comprises a growth factor receptor, and/or 
wherein the non-integrin protein comprises a hemi-desomosome junction, and/or wherein the non- 
integrin protein comprises a SH2 domain, and/or wherein the non-integrin protein comprises a She 

20 protein, and/or wherein the non-integrin protein comprises a IRS-1 protein, and/or wherein the 
non-integrin protein comprises a IRS-2 protein, and/or wherein reducing the production of an 
integrin does not occur by using an antisense molecule to A6, and/or wherein the production of an 
integrin is reduced by inhibiting signaling leading to induction of expression of an integrin, and/or 
wherein reducing the production of an integrin comprises inhibiting alpha6 production, and/or 

25 wherein inhibiting alpha6 production further comprises using antisense molecules to alpha6 
mRNA, and/or wherein reducing the production of an integrin comprises inhibiting beta4 
production, and/or wherein inhibiting beta4 production further comprises using antisense 
molecules to beta4 mRJNA, and/or wherein reducing the production of an integrin receptor ligand 
comprises inhibiting gamma2 production, and/or wherein reducing the production of an integrin 

30 receptor ligand comprises inhibiting lamimn production, and/or wherein reducing the production 
of an integrin receptor ligand comprises inhibiting lamininS production, and/or wherein interfering 
with an integrin signaling pathway does not occur by using a B4-delta-cyt, and/or wherein 
interfering with an integrin signaling pathway does not occur by using an antisense molecule to 
A6, and/or wherein interfering with an integrin signaling pathway comprises contacting an A6 

35 integrin in the cell with a composition that inhibits ligand binding, and/or wherein interfering with 
an integrin signaling pathway comprises contacting a B4 integrin with a composition that inhibits 
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ligand binding, and/or wherein interfering with an integrin signaling pathway comprises contacting 
a lamimnS with a composition that inhibits ligand binding, and/or wherein interfering with an 
integrin signaling pathway comprises contacting a gamma2 subunit with a composition that 
inhibits ligand binding, and/or wherein interfering w r ith an integrin signaling pathway comprises 
5 contacting the cancer cell with a molecule that interferes with at least one of talin, paxillin, 

vinculin, a CAS family protein, CRX, NCK, FAK, ILK, Src, Fyn, She, Grb-2, Guaning nucleotide 
exchange factors, SOS, DOCK 180,.Vav, Syk, P-l-3 kinase, AKT, Bad, Bid, Caspase 9, Cdc42, 
PAK, Rac, Rho, Rho kinase, Ras, Caveolin, Tetraspan, Receptor- type protein tyrosine 
phosphatase, SHP-2, Alpha-actinin 3 Filamin 3 Cytohesin, Beta3-endonexin, ICAP-1, RACK-1, GIB, 
10 actin, receptor tyrosine kinase, IRS-1 or IRS-2, and/or wherein interfering with an integrin 
signaling pathway comprises contacting the cancer cell with an agent that interferes with post- 
translational modification of integrins, and/or wherein the post translational modification is 
glycosylation or phosphorylation. 

Also disclosed are methods, wherein the integrin comprises an A6 integrin, and/or wherein 

15 the integrin comprises a B4 integrin, and/or further comprising reducing a laminin5 -integrin 
interaction, and/or further comprising reducing a lamininS gamma2 integrin interaction, and/or 
wherein the cancer cell comprises normal p53, and/or wherein the proliferation of the cancer cells 
is not dependent on AKT/PKB, and/or wherein reducing the proliferation of the cancer cells is 
selective, and/or wherein the cancer cell is not an MDA-MB-435 cell, and/or wherein the cancer 

20 cell is not an HMT-3522 cell, and/or wherein the cancer cell is not an RivO colon carcinoma line, 
and/or wherein the cancer cell does not express exogenous B4 integrin, and/or further comprising 
contacting the cancer cells with a small molecule, peptide, peptide mimetic, or oligonucleotide or 
synthetic analog thereof, and/or wherein the cancer cells are contacted with dominant-negative 
beta 4 integrin, and/or wherein the cancer cell is contacted with an antisense molecule, and/or 

25 wherein the antisense molecule is linked to a leader sequence which enables translocation across a 
cell membrane, and/or wherein the leader sequence binds to a cell surface protein which facilitates 
internalization, and/or wherein the leader sequence is TAT or antennapedia, or fragment thereof, 
and/or wherein the antisense molecule is an alpha6 RNA antisense molecule., and/or wherein the 
small molecule peptide, peptide mimetic, or oligonucleotide or synthetic analog thereof is linked 

30 to a carrier, and/or wherein the carrier is at least one of a lipidic carrier, charged carrier, retroviral 
carrier, TAT or fragment thereof, antennapedia or fragment thereof, or polyethylene glycol, and/or 
further comprising contacting the cancer cell with another agent which modulates cell signaling, a 
chemotherapeutic drug, or treated with radiation or angiogenesis inhibitor, and/or wherein 
reducing the proliferation of cancer cell is in vitro, and/or wherein reducing the proliferation of the 

35 cancer cell is in vivo, and/or wherein the cancer cell is selected from the group consisting of 

melanoma, adenoma, lymphoma, myeloma, carcinoma, plasmocytoma, sarcoma, glioma, thyoma. 
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leukemia, skin cancer, retinal cancer, breast cancer, prostate cancer, colon cancer, esophageal 
cancer, stomac cancer, pancreas cancer, brain tumors, lung cancer, ovarian cancer, cervical cancer, 
hepatic cancer, gastrointestinal cancer, and head and neck cancer cells, and/or wherein the cancer 
cell is killed, and/or wherein the cancer cell expresses a mutated Ras, and/or wherein the cancer 
5 cell expresses a mutated Ras and a mutated p53, and/or wherein the cancer eel] expresses a 
mutated Ras and activates the AKT/PKB protein, and/or wherein the cancer cell expresses a 
mutated Ras, a mutated p53, and activates the AKT/PKB protein, and/or wherein the cancer cell 
expresses a mutated APC ; and/or wherein the cancer cell expresses a mutated Ras and mutated 
ARC. 

1 0 Disclosed are methods of reducing the proliferation of a cancer cell in a patient which 

comprises administering to the patient a composition which inhibits ligand binding to an integrin 
receptor on the cancer cell, wherein the integrin receptor comprises an integrin, 

Also disclosed are methods of reducing the proliferation of a cancer cell in a patient which 
comprises administering to the patient a composition which reduces integrin- integrin interaction, 
15 integrin receptor clustering interaction, or integrin-non-integrin protein interaction. 

Also disclosed are methods of reducing the proliferation of a cancer cell in a patient which 
comprises administering to the patient a composition which reduces the production of an integrin 
or laminin by the cancer cell. 

Further disclosed are methods of reducing the proliferation of a cancer cell in a patient 
20 which comprises administering to the patient a composition which interfers with an integrin 
signaling pathway. 

Also disclsoed are methods, wherein the reduction in cancer cell proliferation is selective, 
and/or wherein the administering is local or systemic, and/or wherein the patient is additionally 
administered an agent which modulates cell signaling, a chemotherapeutic drug, or treated with 

25 radiation or angiogenesis inhibitor, and/or wherein the additional agent is administered serially or 
in combination, and/or wherein the local administering is direct application to cancer cells, and/or 
wherein the direct application to cancer cells is performed during surgery, and/or wherein the 
direct application to cancer cells is performed topically to cancerous tissue, and/or wherein the 
systemic administration is by subcutaneous, intraperitoneal, intra-arterial, intravenous, or bolus 

30 administration, or by application through a catheter or similar apparatus, and/or wherein the 
systemic administration comprises a long-term release formulation, and/or wherein the systemic 
administration is by oral administration, and/or wherein the oral administration comprises 
administering a pill, capsule, tablet, liquid or suspension. 

Disclosed are methods of reducing the proliferation of a cancer cell in a patient which 
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comprises administering to a patient a vector comprising coding sequence for a protein or peptide 
which inhibits ligand binding to integrin receptors on cancer cells wherein the coding sequence is 
under the control of a promoter which functions in mammalian cells. 

Also disclosed are methods of reducing the proliferation of a cancer cell in a patient which 
5 comprises administering to a patient a vector comprising coding sequence for a protein or peptide 
which prevents integrin receptor subunits from interacting with one another, prevents integrin 
receptor clustering interaction, or prevents integrin receptor subunits from interacting with other 
proteins on cancer cells wherein the coding sequence is under the control of a promoter which 
functions in mammalian cells. 

10 Further disclosed are methods of reducing the proliferation of a cancer cell in a patient 

which comprises administering to a patient a vector comprising coding sequence for a protein or 
peptide which prevents integrin receptor subunits from interacting with one another, prevents 
integrin receptor clustering interaction, or prevents integrin receptor subunits from interacting with 
other proteins in cancer cells wherein the coding sequence is under the control of a promoter which 

15 functions in mammalian cells. 

Also disclosed are method of reducing the proliferation of a cancer cell in a patient which 
comprises administering to a patient a vector comprising coding sequence for a protein or peptide 
which interferes with integrin subunit or laminin production wherein the coding sequence is under 
the control of a promoter which functions in mammalian cells. 

20 Further disclosed are methods of reducing the proliferation of a cancer cell in a patient 

which comprises administering to a patient a vector comprising coding sequence for a protein or 
peptide which interferes with an integrin signaling pathway of cells wherein the coding sequence is 
under the control of a promoter which functions in mammalian cells. 

Also disclosed are methods, wherein the reduction in cancer cell proliferation is selective, 
25 and/or wherein the vector is administered directly to a cancer cell, and/or wherein the vector is 
administered directly to a normal cell, and/or wherein the vector is packaged in a viral vector or 
liposome, and/or wherein the vector is a retroviral vector, and/or wherein the vector is 
administered sysiemically, and/or wherein the direct administration is by topical application, 
and/or wherein the direct administration is by topical application, and/or wherein the direct 
30 administration is performed during surgery, and/or wherein the direct administration is performed 
during surgery, and/or wherein the patient is an animal, such as a mammal, mouse, rabbit, primate, 
chimp, ape, goriilla, and human, and/or wherein the cancer cells are selected from the group 
consisting of melanoma, adenoma, lymphoma, myeloma, carcinoma, plasmacytoma, sarcoma, 
glioma, thyoma, leukemia, skin cancer, retinal cancer, breast cancer, prostate cancer, colon cancer, 
35 esophageal cancer, stomac cancer, pancreas cancer, brain tumors, lung cancer, ovarian cancer, 
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cervical cancer, hepatic cancer, gastrointestinal cancer, and head and neck cancer cells, and/or 
wherein the patient is additionally administered at least one of another agent which modulates cell 
signaling, a chemotherapeutie drug, an angiogenesis inhibitor or treated with radiation, and/or 
wherein the other agent which modifies cell signaling, chemotherapeutie drug, angiogenesis 
5 inhibitor or radiation treatment is administered serially or in combination. 

c) Methods of gene modification and gene disruption 
The disclosed compositions and methods can be used in targeted gene disruption and 
modification in any animal that can undergo these events. Gene modification and gene disruption 
refer to the methods, techniques, and compositions that surround the selective removal or 

1 0 alteration of a gene or stretch of chromosome in an animal, such as a mammal, in a way that 
propagates the modification through the germ line of the mammal. In general, a cell is 
transformed with a vector which is designed to homologously recombine with a region of a 
particular chromosome contained within the cell, as for example, described herein. This 
homologous recombination event can produce a chromosome which has exogenous DNA 

1 5 introduced, for example in frame, with the surrounding DNA. This type of protocol allows for 
very specific mutations, such as point mutations, to be introduced into the genome contained 
within the cell. Methods for performing this type of homologous recombination are disclosed 
herein. 

One of the preferred characteristics of performing homologous recombination in 
20 mammalian cells is that the cells should be able to be cultured, because the desired recombination 
event occur at a low frequency. 

Once the cell is produced through the methods described herein, an animal can be 
produced from this cell through either stem cell technology or cloning technology. For example, if 
the cell into which the nucleic acid was transfected was a stem cell for the organism, then this cell, 

25 after transfection and culturing, can be used to produce an organism which will contain the gene 
modification or disruption in germ line cells, which can then in turn be used to produce another 
animal that possesses the gene modification or disruption in all of its cells. In other methods for 
production of an animal containing the gene modification or disruption in all of its cells, cloning 
technologies can be used. These technologies generally take the nucleus of the transfected cell and 

30 either through fusion or replacement fuse the transfected nucleus with an oocyte which can then be 
manipulated to produce an animal. The advantage of procedures that use cloning instead of ES 
technology is that cells other than ES cells can be transfected. For example, a fibroblast cell, 
which is very easy to culture can be used as the cell which is transfected and has a gene 
modification or disruption event take place, and then cells derived from this cell can be used to 

35 clone a whole animal. Conditional knockouts can also be made which will conditionally delete 
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expression of the desired molecule, for example, an A6 or integrin or the gamma2 chain of 
laminin5. 

d) Methods of diagnosing cancer 
Methods of diagnosing cancer using the disclosed information and the disclosed 
5 molecules. In particular disclosed are methdos of diagnoses that rely on the combined 
upregulation of both the ligand and the cogante integrin receptor or cognate integrins. It is 
understood that all of the methods of diagnosis disclosed herein, can be used with any of the 
disclosed compositions, but they also can be used in conjunction, where for example, both the 
ligand and the integrin receptor are monitored and correlated with cancer developing. 

10 Disclosed are methods of assessing a subject's risk: of developing cancer comprising 

determining the amount of A6 present in a target cell obtained from the subject, wherein a 
determination of increased levels of A6 correlates with an increased risk of cancer. 

Disclosed are methods of assessing a subject's risk of acquiring cancer comprising 
determining the amount of B4 present in a target cell obtained from the subject, wherein a 
15 determination of increased levels of B4 correlates with an increased risk of cancer. 

Disclosed are methods of assessing a subject's risk of acquiring cancer comprising 
determining the amount of lammin5 present in a target cell obtained from the subject, wherein a 
determination of increased levels of Iaminin5 correlates with an increased risk of cancer. 

Disclosed are methods of assessing a subject's risk of acquiring cancer comprising 
20 determining the amount of gamma2 subunit present in a target cell obtained from the subject, 

wherein a determination of increased levels of gamma2 subunit correlates with an increased risk of 
cancer. 

Also disclosed are methods, further comprising comparing the amount A6 present to the 
amount in a control cell, and wherein determining the amount of A6 present in the target cell 

25 comprises assaying the amount of A6 mRNA in the cell, and. or wherein the assaying the amount 
of mRNA in the cell comprises hybridizing a A6 probe to a sample of the subject's mRNA, and/or 
wherein the assaying the amount of mRNA in the cell comprises hybridizing a A6 primer to a 
sample of the subject's mRNA, and/or wherein the assaying the amount of mRNA further 
comprises performing an nucleic acid amplification reaction involving the primer, and/or wherein 

30 the nucleic acid amplification reaction comprises reverse transcription, producing a cDNA, and/or 
wherein the nucleic acid amplification reaction further comprises performing a polymerase chain 
reaction on the cDNA, and,/or wherein determining the amount of A6 present in the target cell 
comprises assaying the amount of A6 protein in the cell, and/or further comprising comparing the 
amount B4 present to the amount in a control cell, and/or further comprising comparing the 
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amount larnininS present to the amount in a control cell, and/or further comprising comparing the 
amount gamrna2 subunit present to the amount in a control cell. 

C Examples 

Throughout this application, various publications are referenced. The disclosures of these 
5 publications in their entireties are hereby incorporated by reference into this application in order to 
more fully describe the state of the art to which this invention pertains. The references disclosed 
are also individually and specifically incorporated by reference herein for the material contained in 
them that is discussed in the sentence in which the reference is relied upon. 

It will be apparent to those skilled in the art that various modifications and variations can 
1 0 be made in the present invention without departing from the scope or spirit of the invention. Other 
embodiments of the invention will be apparent to those skilled in the art from consideration of the 
specification and practice of the invention disclosed herein, It is intended that the specification 
and examples be considered as exemplary only, with a true scope and spirit of the invention being 
indicated by the following claims. 

1 5 The following examples are put forth so as to provide those of ordinary skill in the art with 

a complete disclosure and description of how the compounds, compositions, articles, devices 
and/or methods claimed herein are made and evaluated, and are intended to be purely exemplary of 
the invention and are not intended to limit the scope of what the inventors regard as their 
invention. Efforts have been made to ensure accuracy with respect to numbers (e.g., amounts, 

20 temperature, etc.), but some errors and deviations should be accounted for. Unless indicated 
otherwise, parts are parts by weight, temperature is in °C or is at ambient temperature, and 
pressure is at or near atmospheric. 

1. Example 1 Colonic epithelial cells dependent on alpha6beta4 receptor 
signal transduction for growth in the absence of ECM 

25 Intcgrin alpha6 and integrin beta4 and laminin gamma2 chains are essential for tumor cell 

survival in vivo. The data disclosed herein indicate that interfering with integrin alpha6-mediated 
signaling, integrin beta 4signaling 5 and laminin gamma2 signaling can constitute an effective 
approach to induce programmed cell death in cancer cells without damaging normal cells. 
Ablation of integrin alpha6-dependent signaling, integrin beta4-dependent signaling, and laminin 

30 gamma2- dependent signaling in human cancer cell lines supports this. 

The murine colonic epithelial transformation system introduced by D'Abaco et al. (1996) 
was used. In this system, control cells (control), cells containing an activated ras oncogene (Ras), 
a deletion in the APC gene (APCmin) or both alterations together (Ras-^APCmin) can be compared 
with regard to their proliferation characteristics in tissue culture. The cells were derived from 
35 transgenic mice containing a temperature-sensitive allele of SV40 large T under the control of a 
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gamma interferon-inducible promoter permitting conditional immortalization (Jat et aL, 1991). 
All experiments were carried out with the cells being kept at non-permissive temperature in the 
absence of gamma interferon. As shown previously (D'Abaco et aL, 1996), only the cells carrying 
activated Ras and the APCmin mutation were able to form colonies in soft agar in the absence of 
5 anchorage to a substratum. All other cell populations did not give rise to colonies under these 
conditions. 

The frequency of tunel-staining cells (Fig.2A) and caspase 3 activity (Fig. 2B) in control 
Ras, APCmin and Ras+APCmin colonic epithelial cells (D'Abaco et aL, 1996) in suspension was 
measured. Both assays show a strong suppression of apoptosis in Ras-APCmin cells, indicating 
10 that Ras and APCmin co-operate in preventing cell death in the absence of apparent ECM contacts. 
Activated Ras can protect cells from apoptosis via activation of the serine-threonine kinase AKT 
(Kauffmarm-Zeh et aL, 1997; Khwaja et aL, 1997). However, both Ras and Ras+APCmm cells 
show equivalent levels of AKT phosphorylation (not shown), eliminating AKT as the relevant 
target of Ras+ APCmin cooperation. 

1 5 Although cancer cells have been thought to survive independently of integrin signaling, 

they frequently express high levels of alpha3 or alpha6 integrin receptors and/or their ligands 
(Dedhar et aL, 1993; Kennel et aL, 1989; Koshikawa et aL, 1999; Lohi et aL, 2000; Nejjari et aL, 
1999; Van Waes and Carey, 1992). These integrins are suspected to play a role in invasion and 
metastasis (Mukhopadhyay et aL, 1999; Shaw et aL, 1997). Ras and APCmin mutations would 

20 induce alterations in the rnRNA expression patterns of integrin and ECM components. Induction 
of alpha6 integrin alone, was specific to Ras+APCmin cells (Fig. 3 A). In addition, we found 
elevated expression of the laminin alpha5, gamma2 and beta2 chains induced in Ras and 
Ras+APCmin cells (Fig. 3B> These chains are components of laminins 5, 10 and 1 1 (Malinda 
and Kleinman, 1996), which are ligands of integrin receptors alpha6^eta4 and 

25 alpha3/betal(Kikkawa et aL, 2000; Niessen et aL, 1994). These results indicate that Ras and 
APCmin mutations can cooperate to induce autocrine activation of integrin receptors. 

Alpha6 integrin can form functional receptors together with either betal or beta4 integrins 
(Clark and Brugge, 1995), leading to signaling through downstream effectors such as focal 
adhesion kinase (FAK) or the SH2 domain adapter She (Giancotti and Ruoslahti, 1999). 

30 Conversely, beta4 integrin only binds to alpha6 integrin (Clark and Brugge, 1995). In transformed 
colonic epithelial cells alpha6 integrin functions in conjunction with integrin beta4 and is engaged 
by the laminin gamma2 chain to activate She (see Fig. 4) k Consistent with alpha6 integrin 
induction, only Ras+APCmin cells can bind a laminin gamma2-specific peptide in an alpha6 and 
beta4 integrin-specific manner. In contrast, an alpha3/betal and alpha6/betal -specific peptide 

35 derived from the laminin gamma 1 chain, only binds to control cells (Fig. 4A). In addition, only in 
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Ras4-APCmin cells is She phosphorylated in response to clustering of the integrin alpha6/beta4 
receptor by beta4-specific antibodies (Mainiero et aL, 1995) and by the laminin gamma2-specific 

peptide (Fig. 4B). In both cases p52^ nc is the major phosphorylated Form. Moreover, alphaG 
integrin is required for She activation, as She phosphorylation is inhibited in response to 
5 expression of a dominant-negative beta4 integrin mutant. This mutant lacks the cytoplasmic 
signaling domain but selectively binds to alpha6 integrin to form a ligand binding, yet signaling- 
defective alpha6/beta4 integrin (Spinardi et aL, 1993) (Fig. 4B) 

Anti-sense RNAs specific for alpha6 integrin and gamma2 laminin, as well as dominant- 
negative beta4 integrin, were expressed and results indicated that integrin alpha6, integrin beta4, 

10 and laminin gamma2 expression are relevant for the survival of Ras-f- APCmin cells in 

Ras+ APCmin, Ras, APC and control cells. All three constructs efficiently inhibit the growth of 
Ras+APCmin cells in soft agar (not shown) and when attached to plastic (Fig. 5A1). In contrast, 
the proliferation of wt, Ras and APCmin cells is not affected by this treatment (Fig. 5 A 1). 
Importantly, the colony formation of Ras+APC cells exposed to any of the three inhibitory 

15 constructs can be rescued by co-expression of baculovirus p35, a potent inhibitor of caspase 
activity and of apoptosis (Resnicoff et aL, 1998) (Fig. 5A1). Similarly, co-expression of 
exogenous integrin alpha6 mRNA efficiently rescues Ras- APCmin cell proliferation inhibited by 
alpha6 anti-sense RNA (Fig. 5A1). This is mirrored by a rescue of alpha6 integrin expression on 
the cell surface of alpha6 anti-sense RNA expressing cells (Fig. 5B). Moreover, ectopic alpha6 

20 integrin expression also rescues apoptosis induced by dominant-negative beta4 integrin (Fig. 
5 Al). The latter is expressed at constant levels (Fig. 5B). As one would expect, the cells 
expressing laminin gamma2 antisense RNA could not be rescued by integrin alpha6 over- 
expression, but showed significant rescue when plated on dishes coated with laminin gamma2 
peptide (Fig. 5A1). Similarly, this peptide increases the survival of Ras+APCmin cells in 

25 suspension, as indicated by lower caspase 3 activity (Fig. 2C). In summary, the survival of Ras- 
APCmin cells depends on the expression of alpha6/beta4 integrin and the laminin gamma2 chain. 
Conversely, the inhibition of these gene activities leads to selective killing of transformed cells. 

Data also indicate that cell death due to lack of alpha6/beta4 integrin receptor signaling in 
Ras-APCrnin cells is independent of the death signaling pathway involving the induction of 
30 mitochondrial damage and caspase 9 activity. Instead, the data indicate an involvement of the 
Fas/TNF receptor/death domain protein/caspaseS pathway (Kruidering and Evan, 2000) in the 
control of tumor cell survival by alpha6 integrin-containing integrin receptors. 

Control, APCmin and Ras cells, which lack alpha6/beta4 integrin receptor signaling 
activity, show high levels of caspase 8 activity when kept in suspension, while at the same time 
35 caspase 9 activity cannot be detected (Fig. 2D). 
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Although Ras-APCmm cells can be rescued from apoptosis by caspase inhibitor 
baculovirus p35 (Fig. 5 Al, A2 and text above), cell death induced by expression of dominant- 
negative beta4 integrin cannot be prevented by expression of the survival factor Bcl2 (see Fig. 5 
A2). Bcl2 binds to and neutralizes BH3~domain killer proteins that cause mitochondrial damage, 
5 cytochrome C release and caspase 9 activation (Luo et aL, 1998). 

Ras cells which lack alpha6/beta4 integrin receptor signaling acti vity and cannot survive in 
the absence of ECM contacts, and Ras+APCmin cells which depend on alpha6/beta4 integrin 
receptor signaling for survival, show equivalent levels of AKT phosphorylation (data not shown). 
This indicates that AKT does not serve as the key target for alpha6/beta4 integrin receptor 
1 0 signaling. AKT has been described to promote cell survival via phosphorylation and inactivation 
of Bad, a BH3-domain killer protein (Datta et al. s 1999). 

2. Example 2 Other cancer cells dependent on alpha6beta4 receptor signal 
transduction for growth in the absence of ECM 

The survival of highly transformed primary mouse embryo fibroblasts expressing Ras and 

15 Myc oncoproteins in conjunction with a homozygous ARF null mutation (Kamijo et al., 1997) 

depends on expression and function of alpha6 integrin (Fig. 5C), thus demonstrating that this 

principle also may apply to transformed mesenchymal cells. Moreover, the human colon cancer 

cell line SW480 carrying multiple oncogenic mutations such as activated Ras (Fujita et ai. 1988), 

amplified c-Myc (Cherif et al., 1988), a mutated APC allele (Munemitsu et al., 1995) and a p53 

20 mutation (Abarzua et al., 1995) is effectively killed by anti-sense integrin alpha6 and laminin 

gamma2 RNAs as well as dominant-negative beta4 integrin (Fig. 4D). Similarly, integrin alpha6 

ablation even leads to apoptosis of Ras/APCmin cells in the presence of active SV40 large T (not 

shown). Thus the survival of different types of highly transformed cells depends on alpha6 

integrin expression, beta4 integrin expression and lamininS expression irrespective of the status of 

25 major tumor supressor genes, such as arf, p53 or rb, and for at least sw480 cells, murine colon 

epitheilial cells, and the ras/APCmin cells, alpha6, beta4 3 and Iaminm5/gamma2 are required. The 

sensitivity of transformed cells to ablation of integrin signaling, for example, A6B4 integrin 

receptor signaling, is thus quite remarkable in its apparent generality. 

Oncogenic mutations co-operate to engage autocrine integrin signaling. Importantly, this 
30 signaling mechanism, involving alpha6 integrin, beta4 integrin and the laminin gamma2 chain, 
becomes an essential component of the survival mechanism in transformed colonic epithelial cells. 
In addition, fibroblasts and human colon cancer cells also rely on integrin alpha6 for survival. In 
contrast, normal and partially transformed cells that express alpha6 integrin at low levels do not 
require this polypeptide for survival. Thus, integrin signaling inhibition can lead to selective 
35 killing of cancer cells. 
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The laminin-integrin receptor signaling loop is also relevant to the survival of cells 
transformed by other combinations of oncogenic lesions. Introduction of activated Ras together 
with dominant-negative p53 (Lloyd et al., 1997) into murine colonic epithelial cells (D'Abaco el 
al., 1996; see also above) via retroviral infection supports this. Four distinct polyclonal pools of 
5 infected cells were derived by drug selection: 1) Control (beo/neo) cells, infected with two 
retroviruses carrying neomycin (neo) or bleomycin (bleo) drug resistance markers, respectively; 
2)Ras cells, infected with a Ras/neo virus and a virus with the bleo marker; 3) DNp53 cells, 
infected with a DNp53/bleo virus and a virus with the neo marker; and 4) Ras/DNp53 cells, 
infected with a Ras/neo virus and a DNp53/bleo virus. 

10 As the colon epithelial cells contain temperature-sensitive SV40 large T under control of 

the gamma interferon promoter, all cell populations were drug-selected at the permissive 
temperature (33°C) in the presence of gamma interferon. All further experiments were carried out 
at the non-permissive temperature for SV40 large T in the absence of gamma interferon, as 
described above for Ras+APCmin cells. 

1 5 Similar to Ras+APCrnin cells, only Ras/DNp53 cells but neither bleo/neo, Ras/bleo or 

DNp53/neo cells were able to grow in the absence of ECM contact in soft agar (not shown). 
Moreover, Ras and DNp53 cooperate in the suppression of caspase 3 activity in suspension (Fig. 
2B), and in the induction of alpha6 integrin expression (Fig. 4A). In Ras/DNp53 cells passaged 
through a single round of soft agar growth, alpha6 integrin is expressed at even higher levels, 

20 demonstrating a correlation between the ability of the cells to survive in the absence of ECM 
contact and alpha6 integrin expression, in Ras/DNp53 cells beta4 integrin expression levels are 
also increased, when compared to controls (Fig. 4A). As expected expression of the laminin 
gamma2 chain is induced in Ras/bleo and Ras/DNp53 cells (Fig. 4B). 

Anti-sense RNAs specific for alpha6 integrin and gamma2 laminin, as well as dominant- 
25 negative beta4 integrin were expressed in Ras/DNp53, Ras/bleo, DNp53/neo and neo/blco cells. 
All three constructs efficiently inhibit the growth of Ras/DNp53 cells when attached to plastic. In 
contrast, the proliferation of neo/bleo, Ras/bleo and DNp53/neo cells is not affected by this 
treatment (Fig. 5 A3). Anti-sense alphatf integrin or dominant-negative beta4 integrin also inhibit 
the growth of Ras/DNp53 cells in soft agar (Fig. 5A4). 

30 Integrin alpha6 expression can also be induced by activated Ras and DNp53 in primary murine 
colon crypt epithelial cells (not shown). Furthermore introduction of Ras and Myc oncoproteins 
into our control colon epithelial cells also leads to an induction of alphaG expression (not shown), 
suggesting that induction of alpha6 integrin expression may be an integral component of distinct 
oncogene cooperation paradigms 
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a) Cell types to be tested 

As described herein, there are a variety of cancer and transformed cell types that express 
alpha6 integrin and laminins at high levels (Dedhar ei al. y 1993; Koshikawa et aL, 1999; Lohi et 
al., 2000; Van Waes and Carey, 1992). As disclosed herein the following cell lines other than the 
5 Fibroblasts require alphaG and beta4 integrin expression, as well as laminin gamma2 chain 

expression: (1) the human colon carcinoma cell line SW480 (data not shown), (2) Ras+APCmin- 
transformed murine colon epithelial cells (see Fig. 3A) Murine fibroblasts transformed by 
activated Ras and Myc in conjunction with homozygous Arf null mutation require alpha6 
expression for their survival. These cells do not express integrin beta4 ; but express betal integrin, 
10 indicating the importance of alpha6/betal receptors in the transformed Fibroblasts. 

b) xxxlnducible expression of inhibitors 

To interrupt alpha6 integrin and laminin gamma2 chain-dependent signaling in tumors 
regulatable alpha6 integrin and laminin gamma2 antisense mRNA expression as well as 
regulatable expression of the beta4 integrin the dominant-negative polypeptide described in 
1 5 Section C were used. Constitutive expression of these inhibitors leads to rapid cell death in tissue 
culture. In contrast, the establishment of clonal cell lines with inducible expression of anti-sense 
mRNAs or dominant-negative beta4 integrin minimize such limitations. 

The doxycycline (dox)-inducible reverse tetracycline transact! vator (rtTA) is frequently 
used to overexpress transgenes in a temporally regulated fashion in vitro and in vivo (Efrat et al., 

20 1995; Gossen et aL> 1995; Ray et al. } 1997). These systems are, however, often compromized by 
the levels of gene expression in the absence of dox administration. The tetracycline controlled 
transcriptional silencer (tTS), a fusion protein containing the tet repressor and the KRAB-AB 
domain of the kid-1 transcriptional repressor, is inhibited by doxycycline. As shown in tissue 
culture (Freundlieb et aL, 1999) and in transgenic mice (Zhu et aL, 2001), tTS tightens the control 

25 of transgene expression in rtTA-based systems, i.e. tTS effectively eliminates leaky baseline 
expression without altering the inducibility of rtTA-regulated genes. Thus optimal "off/on" 
regulation of gene expression can be accomplished with the combined use of tTS and rtTA. The 
complete expression system is commercially available from CLONTECH. This system can be 
used for the preparation of all inducible cell lines. 

30 Another regulatable system comprising the estrogen-dependent transactivator GalER- 

VP16 and a promoter under the control of Gal4 DNA binding sites (Braselmann ei aL, 1993) can 
be used. (Perez-Roger ei aL, 1997). This particular experimental set up, was used herein to show 
that the induction of the beta4 integrin dominant-negative mutant in SW480 colon cancer cells 
induces apoptosis in vitro, as measured by caspase 3 activation (Fig. 2B). For in vivo use, 

35 however, a point mutation has to be introduced into the GalER-VP16 transactivator that eliminates 
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its sensitivity to estrogen while retaining its response to the anti-estrogen 40H-tamoxifen 
(Littlewood ei al. 7 1995). 40H-tamoxifen has been demonstrated to reversibly regulate in vivo the 
activity of another regulatable transactivator, the MycER chimera (Pelengaris ei aL, 1999). 

Typically, inducible cell lines are first stably transfected with the anti-sense or dorninant- 
5 negative constructs coupled to the regulatable promoters into the four test cell lines. Subsequently, 
the activator is introduced and the repressor, such as rtTA and tTS (see above) via infection using 
recombinant retroviruses with different selectable markers 
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£. Sequences 

1. SEP ID NQ:1 Human alpha6 integrin cds facc# 4557674) 

50 atggccgccgccgggcagctgtgctfcgcfccfcacctgfcccgcggggcfcccfcgfccGcggcfccggcgcagccfctcaactfcggac 
actcgggaggacaacgtgatccggaaafcatggagaccccgggagcctcttcggcttctcgctggcca^gcactggcaactg 
cagcccgaggacaagcggctgtfcgcfccgbgggggccccgcgcggagaagcgcfctccac tgcagagagccaacagaacggga 
gggctgbacagcfcgcgacatcaccgcccgggggccafcgcacgcggafccgagtbtgataacgafcgcfcgaccccacgfccagaa 
agcaaggaagabcagbggabgggggbcaccgbccagagccaaggbccagggggcaaggbcgbgacafcgfegcbcaccgaba t 

55 gaaaaaaggcagcatgtbaabacgaagcaggaabcccgagacabcbbbgggcggfcgttatgbccbgagfccagaafccfccagg 
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attgaagacgatafcggabgggggagabbggagcbbb-bgbgatgggcgatbgagaggccatcagaaatttggcbcttgccag 
caaggbgbagcagcbacbbbbacbaaagacbbfccabbacabbgbabbbggagccccgggbacbbabaacbggaaagggabfe 
gbbcgbgtagagcaaaagaataacacbtbbbbtgacatgaacatctttgaagabgggccbtabgaagbbggbggagagact 
gagcabgabgaaagbcbcgbbccbgbbccbgcfcaacagbtacbbaggtbbtbcbttggacbcagggaaaggbabbgbbbcb 
5 aaagabgagabcacttbbgbabcbggbgctcccagagccaabcacagtggagccgtggttttgcbgaagagagacabgaag 
tcbgcacabctccbcccbgagcacababtcgabggagaaggbcbggccbcbtcabbbggcbabgabgbggcggtggbggac 
cbcaacaaggatgggbggcaagababagbbabbggagccccacagbabbttgabagagabggagaagfebggaggfcgcagbg 
tatgbcbacabgaaccagcaaggcagabggaabaabgbgaagccaabbcgbcbbaabggaaccaaagabbcbabgbbbggc 
attgcagtaaaaaababbggagababbaabcaagabggcbacccagababbgcagbbggagcbccgbabgabgacbbggga 

10 aaggbbbbbabcbabcabggabcbgcaaatggaabaaabaccaaaccaacacaggbbcfccaagggbababcaccbbabbbb 
ggababtcaabbgcbggaaacabggaccbtgabcgaaabbccbacccbgabgtbgcbgttggb bccctcbcagatbcagfca 
acbatbfcbcagabcccggccbgbgabbaababbcagaaaaccabcacagbaacbccbaacagaatbgacctccgccagaaa 
acagcgbgtggggcgccbagtgggafcabgccbccaggtbaaabccbgbbbfcgaababacbgcbaaccccgcbggbbabaab 
ccbtcaatatcaattgtgggcacacttgaagctgaaaaagaaagaagaaaatctgggcbatccbcaagagbbcagfctbcga 

15 aaccaaggttctgagcccaaatabactcaagaacfcaacbcbgaagaggcagaaacagaaagbgbgcabggaggaaacccbg 
fcggcbacaggataababcagagabaaacbgcgbc^cabbcccabaacbgccbcagbggagabccaagagccaagcbcbcgb 
aggcgagbgaatbcacbbccagaagbbcb fcccaabbcbgaabbcagabgaacccaagacagcbcababbgatgtbcactbc 
bbaaaagagggabgbggagacgacaabgbabgbaacagcaacctbaaacbagaababaaabbb bgcacccgagaaggaaab 
caagacaaatttbcfcbafcbbaccaabbcaaaaaggbgtaccagaacbagbbcbaaaagabcagaaggatabbgctbbagaa 

20 abaacagbgacaaacagcccttccaacccaaggaabcccacaaaagatggcgatgacgcccatgaggcbaaac tgafcbgca 
acgtttccagacactfctaacctattctgcatahagagaactgagggr.fctbccctgagaaacagbhgaght-gtgbtgccaac 
cagaabggctcgcaagctgactgbgagctcggaaabcctbbtaaaagaaattcaaatgbcacbbbttattbggt btbaagb 
acaacbgaagfccacctbtgacaccccababcbggababbaabcbgaagb bagaaacaacaagcaabcaagabaabbfcggcb 
ccaabbacagcbaaagcaaaagbggbbabtgaactgcbtbbabcggbcbcgggagbbgctaaaccbtcccaggbgtabbbb 

25 ggaggbacagfcbgbbggcgagcaagcbabgaaabcbgaagabgaagbgggaagtbbaabagagbabgaabbcagggbaaba 
aacbbaggbaaaccbcbbacaaaccbcggcacagcaaccbbgaacabtcagbggccaaaagaaabtagcaatgggaaabgg 
b tgcbbbabbtggbgaaagtagaatccaaaggattggaaaaggbaacbtgtgagccacaaaaggagabaaacbccctgaac 
ctaacggagbctcacaacbcaagaaagaaacgggaaattactgaaaaacagatagatgataacagaaaatbttcfcttattt 
gctgaaagaaaataccagacbctbaacbgbagcgtgaacgbgaacbgbgbgaacabcagabgcccgcbgcgggggcbggac 

30 agcaaggcgbcbcbbattbbgcgcbcgaggbbatggaacagcacatbbcbagaggaabafctccaaacbgaacbacbtggac 
abbcbcabgcgagccbbcabbgabgbgacbgcbgctgccgaaaababcaggcbgccaaatgcaggcacbcaggbtcgagbg 
acbgbgbbbcccbcaaagacbgbagcbcagbabtcgggagbaccbtggbggabcatccbagtggcbabbcbcgcbgggatc 
fcbgabgcbbgcbbbabtagbgbbbabacbabggaagtgbggbbbcbbcaagagaaabaagaaagatcabbabgabgccaca 
babcacaaggctgagabccabccbcagccabcbgataaagagaggcbbacbtcbgafcgcatag 

35 2. SEP ID NO:2 Human integrin alpha6 protein sequence facc# 

NP 000201): 

1 maaagq^cll ylsagllsrl gaafnldbre dnvirkygdp gslfg£slam hwqlqpedkr 
61 lllvgaprge alplqrar.rb gglyscdiba rgpcbriefd ndadpbsesk edqwmgvbvq 
121 sqgpggkwb cahryekrqh vnbkqesrdi fgrcyvlsqn Irieddmdgg dwsfcdgrlr 

40 181 ghekfgscqq gvaabfbkdf hyivfgapg^ ynwkgivrve qknntffdmn ifedgpyevg 
241 gebehdeslv pvpansylgf sldsgkgivs kdeifcfvsga pranhsgavv llkrdmksah 
301 llpehifdge glassfgydv awdlnkdgw qdivigapqy fdrdgevgga vyvymnqqgr 
361 wnnvkpirln gtkdsmfgia vknigdiaqd gypdiavgap yddlgkvfiy hgsanginbk 
421 ptqvlkgisp yfgysiagnm dldrnsypdv avgslsdsvb ifrsrpvini gkbibvbpnr 

45 4 81 idlrqkbacg apsgiclqvk scfeybanpa gynpsisivg bleaekerrk sglssrvqfr 
541 nqgsepkybq elblkrqkqk vcmee llwlq dnirdklrpi pibasveiqe yasrirvnal 
601 pevlpilnsd epkbahidvh flkegcgddn vcnsnlkley kfcbregnqd kfsylpiqkg 
661 vpelvlkdqk dialeibvbn spsnprnpbk dgddaheakl iabfpdblby sayrelrafp 
721 ekqlscvanq ngsqadcelg npfkrnsnvt £ylvlsbtev bfdbpyldin lklebbsnqd 

50 781 nl apibakak vvielllsvs gvakpsqvyf ggbvvgeqam ksedevgsli eyefrvinlg 
841 kpltnlgbab Iniqwpkeis ngkwllylvk veskglekvb cepqkeinsl nlbeshnsrk 
901 kreibekqid dnrkfslfae rkyqblncsv nvncvnircp Irgldskasl ilrsrlwnsfc 
961 fleeysklny Idilmrafid vbaaaenirl pnagbqvrvb vfpskbvaqy sgvpwwiilv 
1021 ailagilmla llvbilwkcg ffkrnkkdhy dabyhkaeih aqpsdkerlb sda 

55 3. SEP ID NQ:3 Mus muscnlus alpha6 integrin cds (acc# 71 10658): 

abggccgbcgcgggccagbbgbgccbgcbcbaccbgbccgcggggcbbcbagcccggcbgggbacagccbbcaaccbggac 
acccgcgaggacaacgtgabccggaaabcgggggatcccgggagcctcbbcggcbbcbcgcbcgccabgcacbggcagbfcg 
cagccggaggacaagcggcbgbbgctbgbgggggcaccbcgggcagaagcacbcccgcbgcagagggcgaacagaacaggg 
ggccbgbacagcbgbgacabcaccbcccgaggaccbfcgbacacggattgaabbtgabaabgacgcfcgabccbafcgtcagaa 

60 agcaaggaagaccagtggabgggagbcacbgbccagagccaaggbccagggggcaaagbggbgacgbgbgcacafccgabab 
gagaaacggcagcacgtcaacacgaagcaggagbcgcgggatabcbbbggaagabgbbafcgtccbgagbcagaabcbcaga 
abbgaagabgababggacggaggagacbggagbbbcbgcgabggccggbbgagaggccabgaaaagbtbggcbccbgbcag 
caaggagbagcggcbacbbtcactaaggacbttcattacatbgbtbbtggagccccagggacbtacaacbggaaagggabc 
gbccgtgbagaacaaaagaataacacb tfct fcbbgacabgaacatcbtbgaagabgggcccbatgaagbbggbggagagaca 

65 gatcatgabgaaagtctcgtgcccgttcctgctaacagttaccbaggcbtttcgctggacfccagggaagggbattgbbtcb 
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aaagabgacabcacbbbbgbgtctggbgcbccaagagccaabcacagbggggcbgbagbgbbgcb&aaaagagacabgaag 
tccgcacabctgc tcccbgagbabababb bgacggagaaggccbggcbtcctcgbbbggcbabgatgtggcagbggbggac 
cbcaabgcagabgggbggcaagacabcgb tabcggagctccacagbabbbbgabagggabggtgaagtcgggggtgcagbb 
tacgbctacabbaaccagcaaggcaaatggagtaatgtgaagccgabtcgtctaaabgggaccaaagactcgatgt ttgga 
5 atcbctgbgaaaaabataggtgatabbaaccaagabggcbatccagabatbgcbgbtggagctccctatgabgabcfcgggg 
aaggbbbbbabcbabcabggatccccgacbggcabaa t ~accaagccaacacaggbbcbcgaggggacabcgccbbacbbc 
ggcbabtcaabcgcbgggaatabggaccbggabcggaac tccbaccccgaccbbgcbgbgggcbcccbcbcagacbcggbc 
acbabbbbcagabcccggccagbgabbaacabbcbaaaaaccabcacagbgacbccbaacagaabbgaccbccgccagaag 
bccatgbgbggcbcaccbagcqggabatqccbcaaqqbbaaagccbgbbbbgaatabactgcgaaaccbbccggbbafcaac 

10 ccbccaababcaabtbbgggtattcbbgaagcbgaaaaagaaagaagaaaatcagggttgbcatcgagagttcagtttcga 
aaccaaggbtccgagccaaagtabacbcaggagcbgaccctgaabcggcagaagcagcgggcgtgcatggaggagaccctc 
tggctgcaggagaacabcagagacaagcbgcgbcccabccccabcacggcbbcbgtggagabccacgagcccacgtcbcgc 
c 99 c 999 t 9 a£C tcacbccccgaagbbcbbcccabccbgaabbcaaabgaagccaaaacggbccagacagabgbccactbc 
bbaaaggaaggabgbggagacgacaabgbcbgbaacagcaaccbtaagctagagtabaaabbbggbacccgagaaggaaat 

15 caagacaaabbcbcbtaccbbccaabbcaaaaaggcatcccagaabbagbccbaaaagabcagaaagababagcbctggaa 
abaacggbgaccaacagcccttcggabccaaggaabccccggaaagatggcgacgabgcccatgaagccaaactcabcgcc 
acgbbbccagacacbcbgacababbccgcbbacagagaactgagggctbbcccbgagaagcagcbgagctgbgbggccaac 
cagaabggcbcccaagccgactgbgagcbcggaaabcct- tcaagagaaat bccagbgt tact ttc tat ctgattbtaagt 
acaaccgaggtcacctbbgacaccacagabcbggababtaatcbgaagbbggaaacaacaagcaatcaggabaaabbggcb 

20 cc aabbacagcgaaggcaaaagbggbbabbgaatbgct t~ tabcccbcbccggagbcgcbaagcct bcgcaggbgbat tbb 
ggaggbacagbbgbbggbgagcaagcbabgaaabcbgaagatgaagbaggaagbbbaabagagbatcaabbbagggtgabb 
aacbbaggcaagccbcbtaaaaacctcggcacagcaaccbtgaatabacagbggcccaaggagabtagcaatggcaaabgg 
bbgcbbbabbbgabgaaagbbgaabccaaaggbbbggagcagabbgbbbgbgagccacacaabgaaabaaacbacctgaag 
cbgaaggagbcbcacaacbcaagaaagaaacgggaacb tcctgaaaaacagabagabgacagcaggaaabb t tcbbtabtb 

25 cctgaaagaaaataccagacbctcaactgcagcgtcaacgtcaggtgtgfcgaacatcaggtgcccactgcgagggc tgg&o 
acgaaggccbctcbcgbtcbgbgbbccaggb tgbggaacagcacabbbctagaggaabat bccaaacbgaacbacbbggac 
atbcbcgbgacggct bccatagabgtcaccgcbgcbgcbcagaababcaagcbcccbcacgcgggcactcaggbbcgagbg 
acggbgbbbcccbcaa agac t g t agcbcag b ab t caggag bagcbbggbgga b cabccbccbggcbgb b c b bgccggga 1 1 
cbgabgcbggcbcbatbagbgb t bbbacbgtggaagbgtggcbbcbbcaagagaaabaagaaagabcabbacgabgccacc 

30 tabcacaaggctgagabccabactcagccgbcbgataaagagaggcbbacbbccgabgcatag 

4. SEQ ID NQ:4 Mus musctilus alpha6 integrin protein sequence (acc# 
NP 032423) 

1 mavagqlcll ylsagllarl gbafnldbre drrvirksgdp gslfgfslam hwqlqpedkr 
61 lllvgaprae alplqranrt gglyscdibs rgpctriefd ndadpmsesk edqwmgvbvq 

35 121 sqgpggkwb cahryekrqh vntkqesrdi fgrcyvlsqn Irieddmdgg dwsfcdgrlr 
181 ghekfgscqq gvaabftkdf hyivfgapgb ynwkgivrve qknnbf fdmn ifedgpyevg 
241 getdhdeslv pvpansylgf sldsgkgivs kddibEvsga pranhsgavv llkrdmksah 
3 01 llpeyifdge glassfgydv avvdlnadgw qdivigapqy fdrdgevgga vyvyinqqgk 
361 wsnvkpirln gbkdsmfgis vknigdinqd gypdiavgap yddlgkvfiy hgsptgiibk 

40 421 pbgvlegtsp yfgysiagnm dldrnsypdl avgslsdsvt ifrsrpvini Ikbibvbpnr 
481 idlrqksmcg spsgiclkvk acfeybakps gynppisilg ileaekerrk sglssrvqfr 
541 nqgsepkybq elblnrqkgr acmeeblwlq enirdklrpi pibasveiqe pbsirrrvnsl 
601 pevlpilr.sn eakbvqbdvh flkegcgddn ^cnsnlkley kfgtregnqd kfsylpiqkg 
561 ipelvlkdqk dialeibvbn spsdprnprk dgddaheakl iatfpdblby sayrelrafp 

45 721 ekqlscvanq ngsqadcelg npfkrnssvt fylilsbtev bfdttdldin lklettsnqd 
781 klapitakak wielllsls gvakpsqvyf ggbwgegam ksedevgsli eyefrvinlg 
841 kplknlgbab Iniqwpkeis ngkwllylmk veskglegiv cephneinyl klkeshnsrk 
901 krelpekqid dsrkfslfpe rkyqblncsv nvrcvnircp lrgldbkasl vlcsrlwnsb 
961 bleeysklny Idilvrasid vbaaaqnikl phagbqvrvt vfpskb^aqy sgvawwiill 

50 1021 avlagilmla llvfllwkcg ffkrnkkdhy dafcyhkaeih bqpsdkerlb sda 

5. SEQ ID NQ:5 Human integrin beta4 subunit cds Tacc# 6453379): 

abggcagggccacgccccagcccatgggccaggcbgcbcctggcagccbbgabcagcgbcagccbcbcbgggaccbbgaac 
cgcbgcaagaaggccccagbgaagagcbgcacggagbgtgbccgtgbggabaaggacfcgcgccbactgcacagacgagabg 
bbcagggaccggcgcbgcaacacccaggcggagcbgcbggccgcgggcbgccagcgggagagcabcgbggbcatggagagc 

55 agcb tccaaatcacagaggagacccagabbgacaccacccbgcggcgcagccagabgtccccccaaggccbgcgggbccgb 
cbgcggcccggbgaggagcggcabb-tgagcbggaggbgbbbgagccacbggagagccGcgbggacGbgbacabcctcafcg 
gacb tcbccaacbccatgbccgatgabcbggacaaccbcaagaagabggggcagaaccbggcbcgggbccbgagccagc be 
accagcgacbacacbabfcggabbbggcaagbb tgbggacaaagbcagcgbcccgcagacggacabgaggccbgagaagcbg 
aaggagccbbggcccaacagbgacccccccbbcbccbbcaagaacgbcabcagccbgacagaagabgbggabgagbbccgg 

60 aabaaacbgcagggagagcggabcbcaggcaaccbggatgctcctgagggcggcttcgabgccatccbgcagacagctgtg 
bgcacgagggacabbggcbggcgcccggacagcacccaccbgcbggtcbbcbccaccgagbcagccfcfcccactabgaggct 
gatggcgccaacgbgcbggctggcatcatgagccgcaacgabgaacggbgccaccbggacaccacgggcaccfcacacccag 
bacaggacacaggactacccgbcggzgcccacccbggbgcgccbgcbcgccaagcacaacatcabccccabcbbbgcfcgbc 
accaacbacbccbafcagcbacbacgagaagcbbcacaccbafcbbcccbgbcbccbcacbgggggbgcbgcaggaggacbcg 

65 bccaacatcgbggagcbgcbggaggaggccbbcaabcggabccgcfcccaaccbggacabccgggcccbagacagcccccga 
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ggccbbcggacagaggbcaccbccaagabgbbccagaagacgaggacbgggbccbbbcacabccggeggggggaagbgggb 
abataccaggfcgcagcbgcgggccccbgagcacgbggabgggacgcacgbgtgccagctgccggaggaccagaagggcaac 
abccatcbgaaaccbbccbbcbccgacggccbcaagabggacgcgggcabcabcbgbgabgbgbgcaccbgcgagcbgcaa 
aaagaggbgcggfccagctcgcbgcagctbcaacggagacbbcgtgbgcggacagtgbgtgtgcagcgagggcbggagbggc 
5 cagaccbgcaacbgcbccaccggctcbcbgagbgacabbcagcccbgccbgcgggagggcgaggacaagccgbgcbccggc 
cgbggggagbgccagbgcgggcactgbgbgbgcbacggcgaaggccgcbacgagggbcagbbctgcgagbabgacaacb be 
cagbgbccccgcacbbccgggbbcctcfcgcaabgaccgaggacgcbgcbccabgggccagbgbgbgbgbgagccbggbbgg 
acaggcccaagcbgbgactgbccccrcagcaatgccaccbgcabcgacagcaabgggggcabcbgbaabggacgbggccac 
bgbgagbgbggccgcbgccacbgccaccagcagbcgcbcbacacggacaccabcbgcgagabcaacbacbcggcgbccacc 

1 0 cgggccbcbgcgaggacctacgcbccbgcgbgcagbgccaggcgbggggcaccggcgagaagaaggggcgcacgbgbgagg 
aabgcaacbbcaaggbcaagabggbggacgagcbtaagagaggcgaggaggtggbggbgcgcbgcbccbbccgggacgagg 
abgacgacbgcaccbacagcbacaccabggaaggbgacggcgcccctgggcccaacagcactgbccbggtgcacaagaaga 
agggacbgcccbccgggcbccbbcbggbggcbcabcccccbgcbccbccbccbccbgccgcbccbggccctgcbacbgcbg 
ctabgcbggaagtacbgbgccbgcbgcaaggcctgccbqgcacbbctcccgtgcbgcaaccgaggbeacabggbgggcb t b 

1 5 aaggaagaccacbacabgcbgcgggagaaccbgabggccbcbgaccacbbggacacgcccabgcbgcgcagcgggaaccbc 
aagggccgbgacgbggbccgcbggaaggbcaccaacaacabgcagcggccbggcbb bgccacbcabgccgccagcabcaac 
cccacagagcbggbgcccbacgggctgbccbtgcgccbggcccgccbbbgcaccgagaaccbgcbgaagGcbgacactcgg 
gagtgcgcccagcfcgcgccaggaggfcggaggagaaccbqaacgaggbcbacaggcagabcbccggbgbacacaagcbccag 
cagaccaagbtccggcagcagcccaa tgccgggaaaaagcaagaccacaGcabbgbggacacagbgcbgabggcgGCCcgc 

20 bcggccaagccggcccbgcbgaagcbbacagagaagcaggbggaacagagggcctbccacgacctcaaggbggcccccggc 
tacbacaccctcacbgcagaccaggacgcccggggcabggbggagbtccaggagggcgbggagctggbggacgbacgggbg 
ccccbcbbbatccggccbgaggabgacgacgagaagcagcbgcbggbggaggccabcgacgbgcccgcaggcacbgccacc 
ctcggcGgccgccbggbaaacabcaccabGabGaaggagcaagccagagacgfcggbgbccbtbgagGagccbgagtbcbcg 
gbcagccgcggggaccaggbggcccgcabcccbgbcabccggcgbgbccbggacggcgggaagfccccaggbcbccbaccgc 

25 acacaggabggcaccgcgcagggcaaccgggaGbacafcccGcgtggagggbgagcbgcbgbtGcagGcbggggaggccbgg 
aaagagcbgcaggbgaagcbccbggagcbgcaagaagbbgacbccctccbgcggggccgccaggbccgccgbbbccacgbc 
cage bcagcaacccbaagbbbggggcccaccbgggccacccccacbccaccaccabcab cab cagggacccagabgaacbg 
gaccggagcbncacgagbcagabgbbgbcabcacagccacccccbcacggcgaccbgggcgccccgcagaaccccaabgcb 
aaggccgcbgggbccaggaagabccabtbcaacbggcbgcccccbbcbggcaagccaabggggbacagggbaaagbacbgg 

30 atbcagggbgacbccgaabccgaagcccaccbgcbcgacagcaaggbgcccbcagbggagcbcaccaaccbgfcacccgbab 
bgcgaGbabgagabgaaggbgbgcgccbacggggcbcagggcgagggaGCGbacagGbcccbggbgbccbgGcgGacccac 
caggaagfcgcccagcgagccagggcgfccbggccbfccaabgbcgbcbccfcccacggbgacccagcbgagcbgggcbgagccg 
gcbgagaccaacggbgagabcacagcctacgaggbcbgcbabggccbggbcaacgabgacaaccgaccbabbgggcccabg 
aagaaagbgcbggbbgacaacccbaagaaccggatgcbqcbbatbgagaacctbcgggagbcccagcccbaccgctacacg 

35 gbgaaggcgcgcaacggggccggcbgggggccbgagcgcgaggccabcabcaaccbggccacccagaccaagaggcccabg 
bccabccccatcabcccbgacabcccbafccgbggacgcccagagcggggaggacbacgacagGtbccbbabgbacagGgab 
gacgbbcbacgcbcbccabcgggcagccagaggcccagcgbcbccgabgacacbggcbgcggcbggaagbbcgagcccctg 
cbgggggaggagcbggaccbgcggcgcgbcacgbggcggcbgcccccggagcbcabcccgcgccbgbcggccagcagcggg 
cgcbccbccgacgccgaggcgccccacgggcccccggacqacggcggcgcgggcgggaagggcggcagccbgccccgcagb 

40 gcgacacccgggccccccggagagcaccbggtgaatggccggabggactbbgccbtcccgggcagcaccaacbcccbgcac 
aggabgaccacgaccagbgcbgcbgccbatggcacccaccbgagcccacacgbgccccaccgcgbgcbaagcacabccbcc 
acccbcacacgggacbacaacbcacbgacccgctcagaacacbcacactcgaccacacbgccgagggacbacbccacccbc 
accbccgbcbccbcccacggccbcccbcccabcbgggaacacgggaggagcaggcbbccgcbgbccbgggcccbggggbcc 
cggagbcgggcbcagabgaaagggbbccccccbbccaggcgcccacgagacbcbabaabccbggcbgggaggccagcagcg 

45 ccctcctggggcccagacbctcgcctgacfcgctggtgtgSccgacacgccoacccgcctggtgttcfccbgoccfcggggccc 
acabcbcbcagagfcgagcbggcaggagccgcggbgcgagcggccgcbgcagggcbacagbgbggagtaccagcbgcbgaac 
ggcggbgagcbgcabcggcbcaacabccccaacccbgcccagaccbcggbggtggbggaagaccbccbgcccaaccacbcc 
bacgbgbbccgagbgcgggcccagagccaggaaggcbggGgccgagagcgbgagggbgbcabcaccabbgaabcccaggbg 
cacccgcagagcccactgbgbccccbgccaggcfcccgccbbcacbbbgagcactcccagbgccccaggcccgcbggbgbbc 

50 acbgcccbgagaccagacbcgcbgcagcbgagcbgggagcggccacggaggcccaabggggababcgbeggcbaccbggbg 
acctgbgagafcggcccaaggaggaggbccagccaccgcabbccgggbggabggagacagccccgagagccggcbgaccgtg 
ccgggccbcagegagaacgbgcccbacaagbbcaaggtgcaggccaggaccacbgagggcbbcgggccagagcgcgagggc 
abcabcaGGatagagbcccaggabggaggbcccfctccGgcagcbgggcagccgbgccgggcbcbbccagcacccgcbgcaa 
agcgagbacagcagcabcbccaccacccacaccagcgccaccgagcccbbccbagbgggbccgacccbgggggcccagcac 

55 cbggaggcaggcggcbcccbcacccggcabgtgacccaggagbbbgbgagccggacacbgaccaccagcggaaccctbagc 
acccacabggaccaacagbbcbbccaaaGtbga 

6. SEQ ID NQ:6 Human beta4 integrin protein sequence (acc# CAB61345"): 

1 magprpspwa rlllaalisv slsgfclnrck kapvkscbec vrvdkdcayc bdemfrdrrc 
61 nbqaellaag cqxesiwme ss£giteetq idfctlrrsqm spqglrvrlr pgeerhfela 

60 121 vfeplespvd lyilmdfsns msddldnlkk mgcnlarvls glbsdybigf gkfvdkvsvp 
181 qtdmrpeklk epwpnsdppf s£knvislbe dvcef-rnklq gerisgnlda peggfdailg 
241 bavebrdigw rpdsbhllvf sfcesafhyea dganvlagim srnderchld bbgbybqyrb 
3 01 qdypsvpblv rllakhniip iEavtnysys yyeklhtyfp vsslgvlqed ssnivellee 
3 61 afnrirsnld iraldsprgl rtevtskmfq ktrtgsfhir rgevgiyqvq Iralehvdgfc 

65 4 21 hvcqlpedqk gnihlkpsfs dglknndagii cdvcbcelqk evrsarcsfn gdfvcgqcvc 
481 segwsgqbcn csbgslsdiq pclregedkp csgrgecqcg hevcygegry egqfceydnf 
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541 qcprbsgflc ndrgrcsmgq cvcepgwtgp scdcplsnat cidsnggicn grghcecgrc 
601 hchqqslybd biceinysas brasartyap acsarrgapa rrrgaxvrna tsrsrwwtsl 
661 rearrwwcaa psgtrmttap tabpwkvtap lgptalswct rrrdcppgsf wwliplllll 
721 lpllalllll cwkycaccka clallpccnr ghmvgf kedh ymlrenlmas dhldtpmlrs 
5 781 gnlkgrdwr wkvtnnmqrp gfathaasin phelvpygls Irlarlcben llkpdtreca 
841 qlrqeveenl nevyrqisgv hklqqbkfrq qpnagkkqdh hivdtvlmap rsakpallkl 
901 bekqveqraf hdlkvapgyy tltadqdarg mvefqegvel vdvrvplfir pedddekqll 
961 veaidvpagt ablgrrlvni tiikeqardv vsfeqpefsv srgdqvarip virrvlccgk 
1021 sqvsyrtqdg baqgnrdyip vegellfqpg eawkelqvkl lelqevdsll rgrqvrrfhv 

10 1081 glsnpkfgah lgqphsttii irdpdeldrs ftsqmlssqp pphgdlgapq npnakaagsr 
1141 kihfnwlpps gkpmgyrvky wiqgdsesea hlldskvpsv elbnlypycd yemkvcayga 
12 01 qgegpysslv scrbhqevps epgrlafnw sshvtqlswa epaetngeifc ayevcyglvn 
12 61 ddnrpigpmk kvlvdnpknr rallienlres qpyrytvkar ngagwgpere aiinlatqpk 
1321 rpmsipiipd ipivdaqsge dydsflmysd dvlrspsgsq rpsvsddtgc gwkfepllge 

15 13 81 eldlrx^twr Ippeliprls assgrssdae aphgppddgg aggkggslpr satpgppgeh 
1441 lvngrmdfaf pgstnslhrm bbbsaaaygb hlsphvphrv Istsstltrd ynsltrsehs 
1501 hsttlprdys tltsvsshgl ppiwehgrsr Iplswalgsr sraqmkgfpp srgprdsiil 
1561 agrpaapswg pdsrltagvp dtptrlvfsa lgptslrvsw qeprcerplq gysveyqlln 
1621 ggelhrlnip npaqtsvwe dllpnhsyvf rvraqsqegw greregviti esqvhpqspl 

20 1681 cplpgsafbl stpsapgplv fbalspdslq ^swerprrpn gdivgylvtc emaqgggpat 
1741 afrvdgdspe srlbvpglse nvpykfkvqa rttegfgper egiitiesqd ggpfpqlgsr 
1801 aglfqhplqs eyssisttht satepflvgp tlgaqhleag gslbrlwbqe fvsrtlttsg 
1861 blsthmdqqf fqt 

7. SEP ID NQ:7 Mouse iiite^rin beta4 subunit mRNA (not only cds) (acc# 
25 L04678): 

ggaccgtcgaggcagcgggac tgacccagctgggctcactgtattaagaagcggacccgcgacccggagcgcccggggacc 
cgatctgggagcctggacgggtgcagcgcgcaggaatgcagfcccgcctgactcaccagcgcctccttcctacctgcgccgc 
ccgbccabaaagcgcbgctcgbcccgcccgccgccgccgcccbgcbgbcccgccgggctcgcccgcgcgctcagctcgacc 
caacgcagcccaagtccgaggbagtctcactaaggaggaggaggatggcagggccctgttgcagcccatgggbgaagctgc 

30 tgcbgctggcacgaatgctgagtgccagcctccctggagacctggccaaccgcbgcaagaaggctcaggtgaagagctgta 
ccgagbgcabccgggbggacaagagctgtgcctactgcacagacgagctgttcaaggagaggcgctgcaacacccaggcgg 
acgbbcbggctgcaggcbgcaggggagagagcabccbggbcabggagagcagccbbgaaabcacagagaacacccagabcg 
bcaccagccbgcaccgcagccaggbatcbccccaaggccbgcaagtccggctgcggcggggtgaggagcgcacgttbgtgfc 
tccaggbctbbgagccccbggagagccccgbggatcbgtababcctcabggacbbcbccaacbccatgbcbgacgatctgg 

35 acaaccbcaagcagatggggcagaacctggccaagatccbgcgccagcbcaccagcgacbacaccattggatt tggaaagt 
btgbggacaaagtcagcgbcccacagacagacabgaggcccgagaaacbgaaggagccctggcccaacagtgatcccccgt 
bcbccbbcaagaacgt bat cage ttaacggagaatgtggaagaatbcbggaacaaactgcaaggagaacgcatctcaggca 
accbggacgctcctgaagggggctbtgatgccabcctgcagacagctgbgtgcacaagggacabbggctggagggcbgaca 
gcacccacctgctggbgttctccaccgagbctgccbbccacbacgaggctgatggbgccaacgbtctggccggcatcabga 

40 accgcaatgatgagaaabgccaccbggacgcctcgggcgcctacacccaabacaagacacaggacbacccabcagtgccca 
cgctggbbcgccfcgctbgccaagcataacabcatccccabctbbgcbgbcaccaacbacbcbbacagctacbabgagaagc 
tccabaagbabttccccgbctccbctctgggcgbccbgcaggaggabtcatccaacatcgbggagctgctggaggaggcct 
bctatcgaabbcgcbccaacctggacatccgggctctggacagccccagaggcctgagaacagaggtcacctccgatacbc 
bccagaagacggagacbgggtcctt^cacatcaagcggggggaagtgggcacatacaatgtgcatctccgggcagtggagg 

45 acatagabgggacacabgbgtgccagctggctaaagaagaccaagggggcaacatccacctgaaaccctccttctctgatg 
gccbccggabggacgcgagtgbgatctgbgacgtgbgcccctgbgagctgcaaaaggaagtbcgabcagctcgctgtcacb 
bcagaggagacbtcabgbgbggacactgtgbgbgcaatgagggctggagbggcaaaacctgcaacbgcfcccaccggcbcbc 
bgagbgacacacagcccbgcctgcgtgagggbgaggacaaaccgtgcbcgggccacggcgagbgccagtgcggacgctgbg 
bgbgcbabggtgaaggccgctacgagggbcacbtcbgcgagbabgacaacbtccagtgtccccggacctctggatfccctgt 

50 gcaatgaccggggacgctgttctatgggagagtgtgbgtgbgagcctggttggacaggccgcagctgcgactgtcccctca 
gcaabgccaccfcgcabcgabagcaacgggggcatctgcaacggccgaggctactgtgagbgbggccgttgtcactgcaacc 
agcagbcgctcbacacggacaccaccbgtgagabcaacbactcbgcgabactgggtctcfcgbgaggabctccgctccfcgcg 
bacagtgccaggcctggggcaccggggagaagaaagggcgcgcgtgbgacgattgccccbttaaagbcaagabggbagacg 
agcbtaagaaagaagaggtggbggagbactgcbccbtccgggatgaggatgacgactgcacbbacagcbacaacgbggagg 

55 gcgacggcagcccbgggcccaacagcacagtcctggbccacaaaaagaaagactgccbcccggctccbtcctggtggcbca 
tcccccbgctcabctbcctccbgbbgctccbggcgbbgcbbcbgctgctctgctggaaabactgbgcctgctgcaaagcct 
gcctggggc t bcbtccttgcbgcaauegaggfecacatggtgggccttaaggaagatcactatatgcttcgggagaacctga 
bggcc be tgaccacctggacacgcccatgctacgaagcgcgaacctcaagggacgagacacagtccgctggaagat caeca 
acaatgbgcagcgccctggcbbtgccacccatgccgccaccaccagccccacggagcbcgtaccctacgggcbgbcccbgc 

60 gccbfcggccgcctctgcacbgagaaccttatgaagccggccacccgagagtgtgaccagctacgccaggaggtggaggaaa 
abcbgaabgaggtgtabagacaggbcagcggcgcacacaagcbccagcagacgaagbbccgacagcagcccaacgccggga 
aaaagcaagaccacaccatbgtggacacagbgttgctggcgccccgctccgccaagcagatgcbgcrgaagctgacagaga 
agcaggtggagcaggggtcctfeccabgaacbgaaggtggcccctggctactacactgtcacggcagagcaggatgcccggg 
gcabggbggag I Lccaggagggcg bggagctggtggatgtgcgagtgcccctcttcatccggcc tgaggatgatgatgaga 

65 agcagctgcbggtggaggccatbgabgbccctgtgagcacbgccaccct tggtcgccgtctggbaaacatcaccatbatca 
aggaacaagctagtggggtagtgtccbtcgagcagcctgaatactcggbgagfccgtggagaccaggzggcccgcatccctg 
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fccatccggcacahcctggacaatggcaagfccccaggtctcctatagcecaca.ggataa.tacagcacacggacaccgggatt 
abgtfccccgbggagggagagcfcgcbgbbccabccbggggaaaccbggaaggagtbgcaggbgaagctactggagctgcacg 
aggtfcgactcccfccctgcgtggccgccaggtccgccgcttccaagtccaactcagcaaccccaagttcggagcccgcctcg 
gccagcccagcacaaccaccgttattctcgatgaaacggacaggagtctcataaatcaaacactttcatcgcctccgccac 
5 cccatggagacctgggcgccccacagaaccccaatgccaaggctgccggatccaggaagatccatttcaactggctgcccc 
ctccbggcaagccaatggggtacagggtgaagfcactggatccagggcgactctgaatctgaagcccaccttctagatagca 
a 9gtgcccbcagtggaactcaccaacctgbatccctattgcgactacgaaatgaaggtgtgtgcctatggggccaagggtg 
a 999g cc ctatagctcactggtgtcctgccgcacccaccaggaagtacccagtgagccagggaggctggctttcaatgtag 
tctcbbctacggtgactcagctgagctgggcagagccagctgagaccaatggcgagatcacagcctacgaggtctgctatg 
10 gactggbcaatgaggacaacagacccabtggacctatgaagaaggtgctcgtggacaaccccaagaaccggafcgctgctca 
btgagaatcbgcgagattcccagccataccgatacacggbbaaggcgcgcaatggggcaggatggggacccgagagagagg 
ctatcatcaaccfcggcfcacacagcccaagcggcccatgtccatccctatcatcccagacatccccatagfcggacgcccagg 
gtgg^gasgacbacgaaaacbbccttatgtacagbgabgacgbccbgcggtccccagccagcagccagaggcccagcgttt 
ctgatgacactgagcaccbggfcgaabggccggabggacbbfcgcctatccaggcagcgccaactccctgcacagaatgactg 

15 cagccaabgbggccfcafcggcacgcabctgagcccacacctgtcccaccgagbgctgagcacgbcctccacccttacccggg 
acfcaccacbcbcbgacacgcacagagcacbcccacbcaggcacacbbcccagggactactccaccctcacttccctttcct 
cccaagccbcccbccbatcbgggaagabgggaggagcaggcttccgctgtcctggactcttgggtccttgagccgggctca 
cabgaagggbgtgcccgcabccaggggbbcaccagactctataabcctggccgggcagtcagcagcacccbccbggggbac 
aggabbcccgbggggctgtgggtgtgccbgacacacccactcggctggbgbbcbcbgccctggggcgcacgbcbbbgaagg 

20 bgagcbggcaggagccacagbgtgabcggacgcbgcbgggcbacagbgbggaataccagctacbaacgbgcgtcgagatgc 
atcggcbcaacabcccbaacccbggccaaaccbcggbggbggbagaggabcbcctgccbaactacbcbcabgtgbbccggg 
tacgggcacagagccaggagggctggggtcgagagcgagagggbgtcatcaccabcgagbcccaggtgcacccgcagagcc 
cbcbcbgcccccbgccaggcbcagccbbcacbctgagcacccccagcgccccaggaccactggtgttcactgccctaagcc 
cagacbccccgcagctcagcbgggagcggccgaggagccrgcaabggagatabcctbggctacctggtgacctgtgagatgg 

25 cccaaggaggagcaccagccaggacctbccgggbggacggagacaacccbgagagccggttgacbgbaccbggccbcagtg 
agaacgfcbccbbacaagbbcaaggtbcaggccaggacgaccgagggcbtbgggccagagcgbgagggbatcabcaccatcg 
agbctcaggtbggaggccccbbcccacagctgggcagcaabbcbgggcbcbbccagaacccagbgcaaagcgagttcagca 
g c 9tgaccagcacgcacagcaccacgacbgagcccttccbcabggatggtcbaaccctggggacccagcgccfcggaagcag 
gaggctcccbcacccggcatgbgacccaggaatbcgtgacccggacctbaacggccagtggctcbcbcagcactcatatgg 

30 accaacagtbcbbccaaacctgaacctcccccgcgccccagccacctgggcccctccttgcctcctctcctagcgccttct 
tcctcbgcbgcbcbacccacgagcbtgctgaccacagagccagccccbgtagtcagagagcagggctaggtgctgtccagg 
aaccataaagbgggbagaggbgatacaaggtcbbbctgactgcabcccaeccbgggbccaabcccacabgtaacc 

8. SEP ID NQ:8 Murine integrin beta4 protein sequence: 

MAGPCCSPWV KLIiL'L'ARMLS AS LPGDLANR CKKAQVKSCT ECIRVDKSCA YCTDELFKERRCNTQADVLA 
35 AGCRGESII/V MESSLEITEN TQIVTSLHRS QVSPQGLQVR LRRGEERTFVFQVFEPLESP VDLYILMDFS 
MSMSDDLDNL KQMGQNLAKI LRQLTSDYTI GFGKFVDKVSVPQTDMRPEK LKEPWPNSDP PFSFKNVISL 
TENVEEFWNK LQGERISGNL DAPEGGFDAILQTAVCTRDI GWRADSTHLIj VFSTESAFHY EADGANVLAG 
IMNRNDEKCH LDASGAYTQY 

KTQDYPSVPT LVRLLAKHNI IPIFAVTNYS YSYYEKLHKY FPVSSL.GVLQ EDSSNIVELLEEAFYRIRSN 
40 LDIRALDSPR GLRTEVTSDT LQKTETGSFH I KRGEVGT YN VHXjRAVEDIDGTHVCQLAKE DQGGNIHLKP 
SFSDGLRMDA SV1CDVCPCE LQKEVRSARC HFRGDFMCGHCVCNEGWSGK TCNCSTGSIjS DTQPCLREGE 
DKPCSGHGEC QCGRCVCYGE GRYEGHFCEYDNFQCPRTSG FliCNDRGRCS MGECVCEPGW TGRSCDCPLS 
MATCIDSNGG ICNGRGYCEC 

GRCHCNQQSL YTDTTCEINY S AI LGLCEDL RSCVQCQAWG TGEKKGRACD DCPFKVKMVDELKKEEWEY 
45 CSFRDEDDDC TYSYNVEGDG SPGPNSTVLV HKKKDCLiPAP SWWLIPLLI FLLLbLALBLIi LCWKYCACCK 
ACLGkLPCCN RGHMVGFKED HYMLRENLMA SDHLDTPMLRSGNLKGRDTV RWKITNNVQR PG FAT HAAS T 
SPTELVPYGL SltRLGRLCTE NLMKPGTRECDQLRQEVEEN LNEVYRQVSG AHKLQQTKFR QQPNAGKKQD 
HTIVDTVIjLA PRSAKQMDLK 

JjTEKQVEQGS FHELKVAPGY YTVTAEQDAR GMVEFQEGVE LVDVRVPIiFI RP EDDDE KQLLVEAI DVP VS 
50 TATLGRRIiVM ITUKEQASG WSFEQPEYS VSRGDQVARI PVIRHILDNGKSQVSYSTQD NTAHGHRDYV 
PVEGELDFHP GETWKELQVK JjXjELQEVDSL IiRGRQ VRR FQVQ h S NP KFGA RLGQPSTTTV ILDETDRSLI 
NQTLSSPPPP HGDLGAPQNP NAKAAGSRKIHFNWLPPPGK PMGYRVKYWI QGDSESEAHL loDSKVPSVEL 
TNLYPYCDYE KKVCAYGAKG 

EGPYSSLVSC RTHQEVPSEP GRIAFNWSS TVTQLSWAEP AETNGEITAY EVCYGLVNEDNRPIGPMKKV 
55 LVDNPKNRMEj LIENLRDSQP YRYTVKARNG AGWGPEREAI INIiATQPKRPMSIPIIPDIP IVDAQGGEDY 
ENFLMYSDDV LRSPASSQRP SVSDDTEHLV NGRMDFAYPGSAMS3UHRMTA ANVA YGTH h S PHLSHRVLST 
SSTTfTRDYHS LTRTEHSHSG TLPRDYSTLTSLSSQASLLS GKMGGAGFRC PGL'LGP 

9. SEP ID NO:9 Mouse beta4 dominant negative cds: 

afcggcagggccctgbbgcagcccatgggbgaagcbgcbgctgctggcacgaabgctgagbgccagcctcccfcggagacctg 
60 gccaaccgcbgcaagaaggcbcaggtgaagagctgtaccgagtgcafcccgggbggacaagagctgtgccbacfcgcacagac 
gagctgbbcaaggagaggcgcbgcaacacccaggcggacgttcbggcbgcaggctgcaggggagagagcatccbggtcatg 
gagagcagcctbgaaabcacagagaacacccagatcgbcaccagccbgcaccgcagccaggbatctccccaaggcctgcaa 
gfcccggcbgcggcggggtgaggagcgcacgbbbgtgbbccaggbcttbgagcccctggagagccccgtggatctgtatatc 
ctcabggacbbcbccaacbccabgbcbgacgatcfcggacaacctcaagcagatggggcagaacctggccaagatcctgcgc 
65 cagctcaccagcgacbacaccabbggattfcggaaagbttgbggacaaagtcagcgtcccacagacagacafcgaggcccgag 
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aaacfcgaaggagcccfcggcccaacagfcgatcccccgttctccttcaagaacgfctatcagcfctaacggagaatgtggaagaa 
bbcbggaacaaacbgcaaggagaacgcabcbcaggcaaccbggacgcbccbgaagggggcbb tgatgccatcc tgcagaca 
gctgfcgfcgcacaagggacafctggcfcggagggctgacagcacccaccfcgcfcggbQfcbctccaccgagtcfcgccttccactac 
gaggctgatggtgccaacgttctggccggcatcahgaaccgcaatgatgagaaafcgccacctggacgcctcgggcgcctac 
5 acccaatacaagacacaggactacccatcagtgcccacgc bggbbcgccbgcbbgccaagcataacabcabccccabcb tt 
gctgtcaccaactactcttacagctactatgagaagctccataagtatttccccgtctcctctctgggcgtcctgcaggag 
gafctcatccaacatcgtggagctgctggaggaggccttctatcgaafctcgcfcccaacctggacatccgggctctggacagc 
cccagaggccfcgagaacagaggtcacctccgafcacfcctccagaagacggagactgggtcctt tcacatcaagcggggggaa 
gtgggcacafcacaatgtgcatctccgggcagtggaggacafcagatgggacacatgtgtgccagctggcbaaagaagaccaa 

10 gggggcaacabccaccbgaaacccfcccbbcbcbgabggccbccggabggacgcgagbgbgabcbgtgacgbgbgccccbgb 
gagcbgcaaaaggaagbbcgabcagcbcgcbgbcacbbcagaggagacbbcabgbgbggacacbgfcgbgbgcaabgagggc 
bggagtggcaaaaccbgcaacbgcbccaccggctcbcfcgagtgacacacagcccbgccbgcgbgagggbgaggacaaaccg 
tgctcgggccacggcgagbgccagbgcggacgctgbgtgtgctabggfcgaaggccgcfcacgagggtcactbcbgcgagbab 
gacaacbbccagbgbccccggaccbcbggabbccbgbgcaabgaccggggacgctgbtctabgggagagbgbgtgbgtgag 

15 ccbggfcbggacaQgccgcagcbgcgacbgbccccbcagcaabgccacGbgcabcgabagcaacgggggcabcbgcaacggc 
cgaggcbacbgbcagbgbggccgbbgbcacbgcaaccagcagbcgcbcbacacggacaccaccbgbgagabcaacbacbcb 
gcgabacbgggbcbcbgbgaggabcbccgcbccbgcgbacagbgccaggccbggggcaccggggagaagaaagggcgcgcg 
bgbgacgabbgccccbbbaaagbcaagabggbagacgagcbbaagaaagaagaggbggbggagbacbgcbccbtccgggab 
gaggatgacgacbgcacbbacagcbacaacgbggagggcgacggcagcccbgggcccaacagcacagbccbggtccacaaa 

20 aagaaagacbgccbcccggctccbbccbggbggcbcabcccccbgcbcabctbcctccbgbbgcbccbggcgbtgcbtcbg 
cbgcbcbgcbggaaabga 

10. SEP ID NQ:10 Mouse beta4 dominant negative protein sequence: 

MAGPCCSPWV KLLLLARMLS ASIiPGDLANR CKKAQVKSCT ECIRVDKSCA YGTDEIjFKERRCNTQADVDA 
AGCRGESILV KESSLEXTEN TQIVTSXiHRS QVSPQGLQVR LRRGEERTFVFQVFEPLiES P VDXjYIXjMDFS 
25 NSMSDDLDNL KQMGQNLAKI IjRQLTSDYTI GFGKFVDKVSVPQTDMRPEK lkepwpnsdp pfsfknvisl 
TENVEEFWNK XjQGERISGNIj DAPEGGFDA1LQTAVCTRDI GWRADSTHLL VFSTESAFHY EADGANVLAG 
XMNRNDEKCH LDASGAYTQY 

KTQDYPSVPT XjVRI>LiAKHNX IPIFAVTNYS YSYYEKLHKY FPVSSLGVLQ EDSSNIVEIjLjEEAFYRIRSN 
LDIRALDSPR GLRTEVTSDT LQKTETGSFH IKRGEVGTYM VHDRAVED I DGTHVCQLAKE DQGGNIHLKP 
30 SFSDGXjRMDA svicdvcpce LQKEVRSARC HFRGDFMCGHCVGNEGWSGK TCNCSTGSXjS dtqpclrege 
DKPCSGHGEC QCGRCVCYGE GRYEGHFCEYDNFQCPRTSG FliCMDRGRCS MGECVCEPGW TGRSCDCPLS 
NATCIDSNGG XCNGRGYCEC 

GRCHCHQQSIj YTOTTCEINY SAIIjGIjCEDIj RSCVQCQAWG tgekkgracd DCPFKVKMVDELKKEEWEY 
CSFRDEDDDC TYSYKFVEGDG SPGPNSTVLV HKKKDCLPAP S WWLiIPLLI FLLLLLiALLiXjLiIiCWK 

35 11. SEQ IP NQ:11 Murine laminin gamma2 chain complete cds (acc# 

U43327V. 

atgcctgcgctctggctcagctgctgcGtcggtgtcgcgcbcctgctgcccgccagccaggccacctccagga 
gggaagtcbgbgabtgcaatgggaagbccaggcaabgbgbctbbgabcaggagcbccabcgacaagcaggcag 
cgggbbccgbbgcctcaacbgcaatgacaabacagcgggggbbcacbgcgagcggbcgagggaggggtbtbac 

40 cagcatcagagcaagagccgctgcctacccbgcaacbgccacbcaaagggbbccctcagbgcbggatgbgaca 
actcbggacaatgcaggtgtaagccaggtgtgacaggacaaagabgtgaccagtgtcagccaggcttccabat 
gcbcaccgabgcbggatgcacccgagaccaggggcaactagattccaagtgtgactgtgacccagctggcatc 
tctggaccctgtgabbctggccgabgtgbcbgcaaaccagccgbcactggagagcgctgbgataggtgccgac 
cacgtgactatcatcbggaccgggcaaacccfcgagggctgtacccagtgtbtctgcbabgggcabtcagccag 

45 ctgccacgccbcbgccgacbbcagbgbccacaaaabcacbbcaacbttcagbcaggatgbggatggtbggaag 
gcggbbcagagaaacggggcaccbgcaaaactccacbggtcacagcgccatcgggacgbgbttagbtctgccc 
gaagatcagaccccgbctabttcgtggcccctgccaaattcctcggtaaccagcaagtgagbtacgggcagag 
cctgtcbttfcgactaccgcgtggacagaggaggtagacagccgtctgccfcacgatgtgabcctggaaggtgct 
ggbcbacagabcagagcbccbcbgabggcbccaggcaagacacbbccbtgbgggabcacaaagacttacacat 

50 tcagacbgaabgaacabccaagcagbcacbggagbccccagctgagbbabtbcgaatabcgaaggbbactgcg 
gaaccfccacagccctccbgabgabccgagcbacgtacggagaabatagtacagggtacabbgabaacgbgacc 
cbggttbcagcccgcccbgtccbbggagccccagccccbbgggtfcgaacgbbgtgtabgcctgcbggggbaca 
agggacaatbcbgccaggaabgbgcbbcbggbbacaaaagagabbcggcaagabbgggcgcbbbtggcgcctg 
bgttcccbgbaacbgccaaggggagggggccbgbgatccaqacacgggagabbgcbacbcgggggacgagaab 

55 cctgacattgagtgtgctgactgtcccatcggtttctacaatgacccacatgacccccgcagcbgcaagccab 
gbcccbgfccacaatgggbbcagcbgbbcagbgabgccfcgagacagaggaggbggfcgbgtaacaactgtccccc 
tggggbcacaggtgcccgctgtgagcbcbgtgcbgabggcttcbbbggggabcccbbbggggaacabggccca 
gtgaggccbbgbcaacgcbgccaabgcaacaacaacgbggaccccaabgccbctgggaactgbgaccagbbga 
caggcagatgc ttgaaabgbabcbacaacacggccggtgtcbacbgbgaccagbgcaaagcaggbbacbttgg 

60 agacccabtggctcccaacccagcagacaagtgtcgagct tgcaacbgcagccccabgggbgcggagccbgga 
gagbgbcgaggtgabggcagcbgcgbt bgcaagccaggcbbbggcgcc btcaactgbgatcacgcagccctaa 
ccagbbgbcctgcttgcbacaabcaagbgaagabbcagabggaccagbtfcacccagcagcbccagagccbgga 
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ggcccbggtbbcaaaggctcagggbggbggbggbggtggbacagbcccagbgcagcbggaaggcaggabcgag 
caggcbgagcaggcccbbcaggacabbcbgggagaagcbcagatttcagaaggggcaabgagagccgbtigctg 
tccggctggccaaggcaaggagccaagagaacgactacaagacccgcctggatgacctcaagatgactgcaga 
aaggabccgggcccbgggcagbcagcatcagaacagagbbcaggabacgagcagacbcabctcbcagabgcgc 
5 cbgagbcbggcaggaagcgaagcbcbctbggaaaacactaababccabtcbtctgagcacbacgbggggccga 
abgabbbbaaaagbcbggcbcaggaggcbacaagaaaggcagacagccacgctgagbcagcbaacgcaabgaa 
gcaactagcaagggaaactgaggactactccaaacaagcactttcattggcccgcaagchcttgagtggagga 
ggcggaagbggcbcbbgggacagctccgbggbacaaggbcbbabgggaaaatbagagaaaaccaagbccctga 
gccagcagctgbcabtggagggcacccaagccgacabbgaagcbgabaggbcgtabcagcacagbctccgcct 

10 cctggattctgc etc t cage ttcagggagtcagtgatctgtcctttcaggtggaagcaaagaggatcagacaa 
aaggctgattctcbctcaaacctggtgaccagacaaacggatgcattcacgcgtgtgcgaaacaatcfcgggga 
acbgggaaaaagaaacacggcagcbtbbacagacbggaaaggataggagacagacbtcagatcagcbgctbbc 
ccgbgccaaccbbgcbaaaaacagagcccaagaagcgcbaagbabgggcaabgccacbbbfcbabgaagbbgag 
aacatccbgaagaaccbccgagagbbbgabcbgcaggbbgaagacagaaaagcagaggcbgaagaggccabga 

15 agagactcfccctctat tagecagaaggt tgcggatgccagtgacaagacccagcaagcagaaacggccctggg 
gagcgccacbgccgacacccaacgggcaaagaacgcagcbagggaggcccbggagabcagcagcgagabagag 
cbggagabagggagbcbgaacbbggaagcbaabgbgacagcagabggggcc bbggccabggagaaagggacbg 
ccactctgaagagcgagatgagagagatgattgagctggccagaaaggagctggagt ttgacacggataagga 
cacggbgcagctggbgabbacbgaagcccagcaagcbgabgccagagccacgagtgccggagbbaccabccaa 

20 gacacrctcaacacat tggacggcatcctacacctcatagaccagcctggcagtgtggabgaagaagggatga 
tgetatbagaacaaggge tbttccaagccaagacccagatcaacagtcgac ttcggcccttgatgbctgaccb 
ggaggagagggtgcgbcggcagaggaaccacctccatctgctggagacbagcatagabggaabbcbbgctgat 
gtgaagaacctggagaacatbcgagacaacctgcccccaggctgctacaatacccaagcbctbgagcaacagt 
ga 

25 12. SEP ID NO:12 Murine larninin gam ma 2 chain protein sequence (acc# 

AAA85256^ 

1 mpalwlsccl gvalllpasq atsrrevcdc ngksrqcvtd qelhrqagsg frclncndnt 
61 agvhcersre gfyqhqsksr clpcnchskg slsagcdnsg qcrckpgvtg qrcdqcqpgf 
121 hmltdagctr dqgqldskcd cdpagisgpc dsgrcvckpa vtgercdrcr prdyhldran 

30 181 pegebqefey ghsaschasa dfsvhkitst fsqdvdgwka vqrngapakl hwsqrhrdvf 
241 ssarrsdpvy fvapakflgn qqvsygqsls fdyrvdrggr qpsaydvile gaglqirapl 
301 mapgktlpcg ibktybfrln ehpsshwspq Isyfeyrrll rnlballmir abygeysbgy 
3 SI idnvblvear pvlgapapwv ercvcllgyk gqfcqecasg ykrdsarlga fgacvpcncq 
421 gegacdpotg dcysgdenpd iecadcpigf yndphdprsc kpcpchngfs csvmpeteev 

35 4 81 venneppevt garcelcadg ffgdpfgehg pvrpcqrcqc nnnvdpnasg ncdqltgrcl 
541 kciynbacvy cdqckagyfg dplapnpadk craencspmg aepgecrgdg scvckpgfga 
601 fncdhaalbs cpacynqvki qmdqfbqqlq slealvskaq ggggggbvpv qlegrieqae 
661 qalqdilgea qisegamrav avrlakarsq endykbrldd lkmtaerira Igsqhqnrvq 
721 dtsrlisqrnr lslagseall enbnihsseh yvgpndfksl aqeabrkads haesanamkq 

40 781 laretedysfc qalslarkll sggggsgswd sswqglmgk lektkslsqq lslegtqadi 
841 eadrsyqhsl rlldsasqlq gvsdlsfqve akrirqkads lsnlvtrqbd afbrvrr.nlg 
901 nwekebrqll qbgkdrrqbs dqllsranla knraqealsm gnabfyeven ilknlrefdl 
961 qvedrkaeae eamkrlssis qkvadasdkb qqaatalgsa badbqrakna arealeisse 
1021 ieleigslnl eanvtadgal amekgbablk semremiela rkelefdbdk dbvglvibea 

45 1081 qqadarabsa gvbiqdblnb Idgilhlidq pgsvdeegmm lleqglfqak bqinsrlrpl 
1141 msdleervrr qrnhlhlleb sidgiladvk nlenirdnlp pgcynbqale qq 

13. SEQ ID NQ:13 Human larninin gamma2 chain complete cds (acc# 

AH006634^ 

abgccfcgcgcbctggcbgggcbgcbgccbctgcbbcbcgctccbccbgcccgcagcccgggccaccbccaggagggaagbc 
50 bgbgabbgcaabgggaagbccaggcagtgbabcbtbgat cgggaacbbcacagacaaacbggfcaabggabbccgcbgcctc 
aacbgcaabgacaacacbgatggcabtcacbgcgagaagtgcaagaafcggcbb t baccggcacagagaaagggaccgcbgb 
tbgcccbgcaabtgbaacbccaaaggtbctcbbagtgcbcgatgbgacaacbcbggacggbgcagctgtaaaccaggtgbg 
acaggagccaca t-gcgaccgatgfcctgccaggcttc!cacabgct".cacgga tgcggggbgcacccaagaccagagactgcta 
gactccaagtgtgactgtgacccagctggcatcgcagggcccbgtgacgcgggccgctgtgtctgcaagccagctgttact 
55 ggagaacgctgfcgataggbgtcgabcaggbfcacbataabctggabggggggaacccbgagggcbgbacccagbgbbbcbgc 
fcabgggcafcbcagccagcbgccgcagctcbgcagaabacagtgbccabaagafccaccbcbaccbbbcabcaagabgbtgab 
ggcbggaaggctgfcccaacgaaabgggbcbccbgcaaagctccaabggtcacagcgccabcaagabgbgbbbagctcagcc 
caacgactagacccbgbcbatbbtgbggcbccbgccaaatttcbbgggaatcaacaggbgagcbabgggcaaagccbgbce 
tbbgactaccctgbggacagaggaggcagacacccabczgcccabgatgtgabtcbggaaggbgcbggtcbacggatcaca 
60 gctcccbhgatgccacbbggcaagacactgccttgtgggcfccaccaagacttaca cattcaggttaaatgagcatccaagc 
aataattggagcccccagcfcgagtfcactttgagtatcgaaggbtactgcggaatcfccacagcccbccgcabccgagctaca 
tafcggagaatacagtactgggtacafcfcgacaafcgtgaccctgafcttcagcccgccctgfcctcbggagccccagcacccfcgg 
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gbbgaacagbgbababgbccbgbtgggbacaaggggcaat bcbgccaggabbgbgcbbcbggcbacaagagagabbcagcg 
agacbggggccbbbbggcaccbgbabbccbbgbaacbgbcaagggggaggggccbgbgabccagacacaggagabbgtbab 
bcaggggatgagaabccbgacabbgagbgbgcbgacbgcccaabbggbbbcbacaacgabccgcacgacccccgcagcbgc 
aagccabgbcccbgbcabaacgggbbcagcbgcbcagbgabgccggagacggaggaggbggbgbgcaabaacbgcccbccc 
5 ggggbcaccggbgcccgcbgbgagcbchgbgcbgabggctacbbbggggaccccbbbggbgaacabggcccagbgaggccb 
tgbcagcccbgbcaabgcaacaacaabgbggaccccagbgcctcbgggaabbgbgaccggctgacaggcaggbgbbbgaag 
tgtabccacaacacagccggcabchacbgcgaccagbgcaaagcaggcbacbbcggggacccabtggcbcccaacccagca 
gacaagbgbcgagcbbgcaacbgtaaccccabgggcbcagagccbgbaggabgbcgaagbgabggcaccbgbgbbbgcaag 
ccaggabbbggbggccccaacbgtgagcabggagcab b cage bgbccagcbbgc bat aabcaagbgaagabbcagabggab 

1U cagbbbabgcagcagcbbcagagaabggaggcccbgabbbcaaaggcbcagggbggbgabggagbagbacctgabacagag 
cbggaaggcacgabgcagcaggcbgagcaggcccbbcaggacabbcbgagagabgcccagabbbcagaaggbgcbagcaga 
bcccbbgghcbccagbbggccaaggbgaggagccaagagaacagcbaccagacccgccbggabgaccbcaagabgactgbg 
gaaagagbbcgcgcbcbgggaagbcagbaccagaaccgagbbcgggabacbcacaggcbcatcactcagabgcagcbgagc 
cbggcagaaacbgaagcbbccbtgggaaacacbaacabtccbgccbcagaccacbacghggggccaaabggcbbtaaaagb 

15 cbggcbcaggaggccacaagabbagcagaaagccacgbbgagbcagccagbaacabggagcaacbgacaagggaaac tgag 
gacbabbccaaacaagcccbcbcacbggbgcgcaaggccebgcabgaaggagbcggaagcggaagcggbagcccggacggb 
gcbgbggbgcaagggcbbgbggaaaaabbggagaaaaccaagbcccbggcccagcagbbgacaagggaggccacbcaagcg 
gaaabbgaagcagabaggbcbbatcagcacagbcbccgccbccbggabbcagbgbctcggcb bcagggagbcagbgatcag 
bccbbbcaggbcgaagaagcaaagaggabcaaacaaaaagcggabbcacbcbcaacgcbggtaaccaggcababggabgag 

20 bbcaagcgtacacaaaagaabcbgggaaacbggaaagaagaagcacagcagcbcbbacagaabggaaaaagbgggagagag 
aaahcagahcaccbgcbbbcccgbgccaabcbbgcbaaaagcagagcacaagaagcacbgagbahgggcaabgccact bbb 
babgaagbbgagagcatccbbaaaaaccbcagagagbbbgaccbgcaggbggacaacagaaaagcagaagcbgaagaagcc 
abgaagagacbcbccbacabcagccagaaggbt bcagatgccagbgacaagacccagcaagcagaaagagGccbggggagG 
gcbgcbgcbgabgcacagagggcaaagaabggggccggggaggcccbggaaabcbccagbgagabbgaacaggagabbggg 

25 agbcbgaacbbggaagccaabgbgacagcagabggagccbbggccabggaaaagggacbggccbcbcbgaagagbgagabg 
agggaagbggaaggagagcbggaaaggaaggagctggagb t bgacacgaatabggabgcagtacagabggbgabbacagaa 
gcccagaaggbbgabaccagagccaagaacgcbggggbbacaatccaagacacacbcaacacattagacggccbccbgcab 
cbgabggaccacccbcbcagbgbagabgaagaggggcbggbctbacbggagcagaagcbbbcccgagccaagacccagabc 
aacagccaacbccggcccabgabgtcagagcbggaagagagggcacgbcagcagaggggccaccbccabbbgcbggagaca 

30 agcabagabggcabbcbggcbgabgtgaagaacbbggagaacabbagggacaaccbgcccccaggcbgcbacaabacccag 
gcbcbbgagcaacagbga 

14. SEP ID NO: 14 Human laminin gamma2 chain protein sequence facc# 
AAC50457") Alternative splice form 

1 mpalwlgccl cfslllpaar atsrrevcdc ngksrqcifd relhrqbgng frclncndnb 
35 61 dgihcekckn gfyrhrerdr clpcncnskg slsarcdnsg rcsckpgvbg arcdrclpgf 
121 hmlbdagcbq dqrlldskcd cdpagiagpc dagrcvekpa vbgercdrcr sgyynldggn 
181 pegebqefey ghsascrssa eysvhkibsb fhqdvdgwka vqrngspakl qwsqrhqdvf 
241 ssaqrldpvy fvapakflgn qqvsygqsls fdyrvdrggr hpsahdvile gaglribapl 

3 01 mplgkblpcg Itkbybfrln ehpsnnwspq Isyfeyrrll rnlbalrira bygeysbgyi 
40 3 SI dnvblisarp vsgapapwve qcicpvgykg qfcqdcasgy krdsarlgpf gbcipcncqg 

421 ggacdpdbgd cysgdenpdi ecadepigfy ndphdprsck pepchngf sc svmpebeew 

4 81 cnncppgvbg arcelcadgy fgdpfgehgp vrpcqpcqcn nnvdpsasgn cdrlbgrclk 
541 cihnbagiyc dqckagyfgd plapnpadkc racncnpmgs epvgcrsdgb cvckpgfggp 
601 ncehgafscp acynqvkiqm dqfmqqlqrm ealiskaqgg dgwpdbele grmqqaeqal 

45 661 qdilrdaqis egasrslglq lakvrsqens yqsrlddlkm tvervralgs qyqnrvrdbh 
721 rlibqmqlsl aeseaslgnt nipasdhyvg pngfkslaqe abrlaeshve sasnmeqlbr 
7 81 ebedyskqal slvrkalheg vgsgsgspdg ayvqglvekl ekbkslaqql breabqaeie 
841 adrsyqbslr lldsvsrlqg vsdqsfqvee akrikqkads Isblvbrhmd efkrbqknlg 
9 01 nwkeeaqqll qngksgreks dqllsranla ksraqealsm gnabfyeves ilknlrefdl 

50 961 qvdnrkaeae eamkrlsyis qkvsdasdkb qqaeralgsa aadaqrakng agealeisse 
1021 ieqeigslnl eanvbadgal amekglaslk semrevegel erkelefdbn mdavqmvibe 
1081 aqkvdbrakn agvbiqdbln bldgllhlmd gplsvdeegl vlleqklsra kbqinsqlrp 
1141 mmseleerar qqrghlhlle bsidgiladv knlenirdnl ppgcynbqal eqq 

15, SEQ ID NQ:15 Human laminin ganima2 chain protein sequence (acc# 
55 AAC504561 Alternative splice form 

1 mpalwlgccl cfslllpaar atsrrevcdc ngksrqcifd relhrgbgng frclncndnb 
61 dgihcekckn gfyrhrerdr clpcncnskg slsarcdnsg rcsckpgvbg arcdrclpgf 
121 hrnlbdagcbq dqrlldskcd cdpagiagpc dagrcvekpa vbgercdrcr sgyynldggn 
181 pegebqefey ghsascrssa eysvhkitst fhqdvdgwka vqrngspakl qvjsqrhqdvf 
60 241 ssaqrldpvy fvapakflgn qqvsygqsls fdyrvdrggr hpsahdvile gaglribapl 

3 01 mplgkblpcg Ibkbybfrln ehpsnnwspq Isyfeyrrll rnlbalrira bygeysbgyi 
361 dnvblisarp vsgapapwve qcicpvgykg qfcqdcasgy krdsarlgpf gbcipcncqg 
421 ggacdpdbgd cysgdenpdi ecadepigfy ndphdprsck pcpchngfgc svmpebeew 

4 81 cnncppgvbg arcelcadgy fgdpfgehgp vrpcqpcqcn nnvdpsasgn cdrlbgrclk 
65 541 cihnbagiyc dqckagyfgd plapnpadkc racncnpmgs epvgcrsdgt cvckpgbggp 
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601 ncehgafscp acynqvkiqm dqfmqglqrm ealiskaqgg dgwpdfcele grmqqaeqal 
661 qdilrdaqis egasrslglq lakvrsgens yqsrlddlkm tvervralgs gygnr-vrdth 
721 rlifcqrnqlsl aeseaslgnt nipasdhyvg pngfkslaqe atrlaeshve sasnrneqltr 
781 etedyskqal slvrkalheg vgsgsgspdg awqglvekl ektkslaqql treatqaeie 
5 841 adrsyqhslr lldsvsrlqg vsdqsfqvee akrikqkads Istlvtrhmd efkxtqknlg 
901 nwkeeaqqll qngksgreks dqllsranla ksraqealsm gnatfyeves ilknlrefdl 
961 qvdnrkaeae eamkrlsyis qkvsdasdkt qqaeralgsa aadaqrakng agealeisse 
1021 ieqeigslnl eanvbadgal amekglaslk semrevegel erkelefdtn mdavqmvite 
10 81 aqkvdtrakn agvfciqdtln fcldgllhlmg m 

1 0 16. Degenerate Human integrin beta4 sub unit cds 

atg gca gCg cca cgc ccc age cca tgg gec agg ctg etc chg gca gec fctg afcc age gtc 
age etc tct ggg ace fctg aac cgc tgc aag aag gec cca gtg aag age tgc acg gag tgt 
gtc cgt gtg gat aag gac tgc gec tac tgc aca gac gag atg ttc agg 

gaccggcgctgcaacacccaggcggagctgctggccgcgggctgccagcgggagagcatcgtggtcatggagagcagcttc 

1 5 caaatcacagaggagacccagattgacaccaccctgcggcgcagccagatgtccccceaaggcctgcgggtccgtctgcgg 
cccggtgaggagcggcattttgagctggaggtgtttgagccactggagagccccgtggacctgtacatcctcatggacttc 
tccaactccatgtccgatgatctgcacaacctcaagaagatggggcagaacctggctcgggtcctgagccagctcaccagc 
gactacactattggatttggcaagt t tgtggacaaagtcagcgfccccgcagacggacatgaggcc tgagaagctgaaggag 
ccttggcccaacagtgacccccccttctccfctcaagaacgtcatcagcctgacagaagatgtggatgagt teeggaataaa 

20 ctgcagggagagcggatctcaggcaacctggatgctcctgagggcggcttcgatgccatcctgcagacagctgtgtgcacg 
agggacattggctggcgcccggacagcacccacctgctggtcttctccaccgagtcagccttccactatgaggcfcgatggc 
gccaacgtgctggctggcatcatgagccgcaacgatgaacggtgccacctggacaccacgggcacctacacccagtacagg 
acacaggactacccgtcggtgcccaccctggtgcgcctgctcgccaagcacaacatcatccccatct t tgctgtcaccaac 
tactcctatagctactacgagaagcttcacacctatttccctgtctcctcactgggggtgctgcaggaggactcgtccaac 

25 atcgtggagctgctggaggaggcct tcaafccggatccgctccaacc tggacatccgggcccfcagacagcccccgaggcctt 
cggacagaggtcaccfcccaagatgttccagaagacgaggactgggtcctt tcacatccggcggggggaagtgggtatatac 
caggtgcagctgcgggcccttgagcacgtggatgggacgeacgtgtgccagctgccggaggaccagaagggcaacatccat 
ctgaaaccttcc ttctccgacggcct caagatggacgcgggca teat ctg t gat gtg tgcacctgcgagctgcaaaaagag 
gtgcggtcagctcgctgcagcttcaacggagacttcgtgtgcggacagtgtgtgtgcagcgagggctggagtggccagacc 

30 tgcaactgctccaccggctctctgagtgacattcagccctgcctgcgggagggcgaggacaagccgtgctccggccgtggg 
gagtgccagtgcgggcactgtgtgtgctacggcgaaggccgctacgagggtcagtfcctgcgagtatgacaacttccagtgt 
ccccgcacttccgggttcctctgcaatgaccgaggacgctgctccatgggccagtgtgtgtgtgagcctggttggacaggc 
ccaagctgtgactgtcccctcagcaatgccacctgcatcgacagcaatgggggcatctgtaatggacgtggccactgtgag 
tgtggccgctgccactgccaccagcagtcgctctacacggacaccatctgcgagatcaactactcggcgtccacccgggcc 

35 tctgcgaggacctacgctcctgcgtgcagtgccaggcgzggggcaccggcgagaagaaggggcgcacgtgtgaggaatgca 
acttcaaggtcaagatggtggacgagcttaagagaggcgaggaggtggtggtgcgctgctccttccgggacgaggatgacg 
actgcacctacagctacaccatggaaggtgacggcgcccctgggcccaacagcactgtcetggtgcacaagaagaagggac 
tgc cc t ceggge t cc t t cfcggtggc t ca t cc ccc t gc t cc t cc t cc t cc tgc eg ctcctggccc tec tac tgc tgc t a tgc 
tggaagtactgtgcctgctgcaaggcctgcc tggcactzctcccgtgctgcaaccgaggtcacatcgtgggctttaaggaa 

40 gaccact aca tgc tgcgggagaacctga fcgg ccfcctgaccacttggacacgccca tgc tgcgcagcgggaacctcaagggc 
cgtgacgtggtccgctggaaggtcaccaacaacatgcagcggcctggctttgccactcatgccgccagcatcaaccccaca 
gage tgg tgccctacgggc tgt ccttgcgcctggc ccgcctttgcac cgagaacc tgc tgaagcctcacactcgggag tgc 
gcccagctgcgccaggaggtggaggagaacctgaacgaggtctacaggcagatctccggtgtacacaagctccagcagacc 
aagttccggcagcagcccaatgccgggaaaaagcaagaccacaccattgtggacacagtgctgatgccgccccgctcggcc 

45 aagccggccctgctgaagcttacagagaagcaggtggaacagagggccttccacgacctcaaggtgccccccggctactac 
accctcactgcagaccaggacgcccggggcatggtggagttccaggagggcgtggagctggtggacctacgggtgcccctc 
tttatccggcctgaggatgacgacgagaagcagctgctggtggaggccatcgacgtgcccgcaggcactgccaccctcggc 
cgccgcctggtaaacatcaccatcatcaaggagcaagccagagacgtggfcgtcct ttgagcagcctcagt t cteggt cage 
cgcggggaccaggtggcccgcatccctgtcatccggcgtgtcctggacggcgggaagtcccaggtctcctaccgcacacag 

50 gatggcaccgcgcagggcaaccgggactacatccccgtggagggtgagctgctgttccagccfcggggaggcctggaaagag 
ctgcaggtgaagc t cctggagctgcaagaagt tgact ccctcctgcggggccgccaggtccgccgtttccaegtccagctc 
agcaaccctaagt fctggggcccacctgggccagccccactccaccaccatcatcatcagggacccagatgaactggaccgg 
agct tcacgagtcagatgt fcgtcatcacagccaccccctcacggcgacctgggcgccccgcagaaccccaatgctaaggcc 
gctgggtccaggaagatccatttcaactggc tgcccccttctggcaagccaatggggfcacagggtaaagfcactgga t tcag 

55 ggtgactccgaatccgaagcccacctgctcgacagcaaggtgccctcagtggagctcaccaacctgtacccgtattgcgac 
tatgagatgaaggtgtgcgcctacggggctcagggcgagggaccctacagctccctggtgtcctgccgcacccaccaggaa 
gtgcccagcgagccagggcgtctggccttcaatgtcgtctcctccacggtgacccagctgagctgggctgagccggctgag 
accaacggtgagatcacagcctacgaggtctgctatggcctggtcaacgatgacaaccgacctattgggcccatgaagaaa 
gtgctggttgacaaccctaagaaccggatgctgcttattgagaaccttcgggagtcccagccctaccgctacacggtgaag 

60 gcgcgcaacggggccggctgggggcctgagcgggaggccatcafccaacctggccacccagcccaagaggcccatgtccatc 
cccatcafcccctgacatccctatcgtggacgcccagagcggggaggactacgacagcttccttatgtacagcgatgacgtt 
ctacgctetccatcgggcagccagaggcccagcgtctccgatgacactggctgcggctggaagttcgagcccctgctgggg 
gaggagctggacctgcggcgcgtcacgtggcggctgcccccggagctcatcccgcgcctgtcggccagcagcgggcgctcc 
tccgacgccgaggcgccccacgggcccccggacgacggcggcgcgggcgggaagggcggcagcctgccccgcagtgcgaca 

65 cccgggccccccggagagcacctggtgaatggccggatggactttgccttcccgggcagcaccaactccctgcacaggatg 
accacgaccagtgctgctgcctatggcacccacctgagcccacacgtgccccaccgcgtgctaagcacatcctccaccctc 
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acacgggacbacaacbcacbgacccgcbcagaacacbcacacbcgaccacacbgccgagggacbacbccacccbcaccbcc 

gbcbccbcccacggccbcccbcccabcbgggaacacgggaggagcaggcttccgcbgtcctgggeccbggggbcccggagb 

cgggcbcagabgaaagggttccccccttccaggggcccacgagactctataatccbggcbgggaggccagcagcgcccbcc 

tggggcccagactctcgccbgacbgcbggbgtgcccgacacgcccacccgccbggtgfcbctcbgcccbggggcccacabcb 

5 ctcagagtgagctggcaggagccgcggtgcgagcggccgctgcagggctacagtgbggagbaccagcbgcbgaacggcggb 

gagcbgcabcggcbcaacabccccaacccbgcccagacctcggbggbggtggaagaccbccbgcccaaccacbccbcicgbg 

bbccgcgbgcgggcccagagccaggaaggcbggggcccagagcgbgagggbgtcabcaccabbgaabcccaggbgcacccg 

cagagcccactgbgtccccbgccaggcbccgcctbcacbbbgagcacbcccagtgccccaggcccgcbggbgbbcacbgcc 

cbgagcccagacbcgctgcagcbgagctgggagcggccacggaggcccaatggggabafccgbcggcbaccbggbgaccbgb 

10 gagatggcccaaggaggaggbccagccaccgcattccgqgbggabggagacagccccgagagccggcbgaccgbgccgggc 

cbcagcgagaacgbgcccbacaagbbcaaggbgcaggccaggaccacbgagggcbbcgggccagagcgcgagggcabcabc 

accabagagbcccaggabggaggbcccbbcccgcagcbgggcagccgbgccgggcbcbbccagcacccgcbgcaaagcgag 

bacagcagcabcbccaccacccacaccagcgccaccgagcccbbccbagbgggtccgacccbgggggcccagcaccbggag 

gcaggcggcbcccbcacccggcabgtgacccaggagbbbgbgagccggacacbgaccaccagcggaacccbtagcacccac 
15 abggaccaacagbbcbtccaaacbbga 

17. SEP ID NO: 17 Human beta4 integrin protein sequence Variant V at 34 to 
t 

1 magprpspwa rlllaalisv slsgblnrck kaplkscbec vrvdkdcayc bdemfrdrrc 
61 nbqaellaag cqresiwme ssfgibeebg idbfclrrsqm spqglrvrlr pgeerhfele 
20 121 vfeplespvd lyilmdfsns msddldnlkk mgqnlarvls qlbsdytigf gkfvdkvsvp 
181 qbdmrpeklk epwpnsdppf sfknvislte dvdefrnklq gerisgnlda peggfdailq 
241 bavebrdigw rpdsthllvf stesafhyea dganvlag.im srnderchld bbgbybgyrb 

3 01 qdypsvpblv rllakhniip ifavtnysys yyeklhbyfp vsslgvlqed ssnivellee 
361 afnrirsnld iraldsprgl rtevbskmfq kbrbgsfhir rgevgiyqvq Iralehvdgt 

25 421 hvcqlpedqk gnihlkpsfs dglkmdagii cdvcbcelqk evrsarcsfn gdfvcgqcvc 

4 81 segwsgqbcn csbgslsdiq pclregedkp csgrgecqcg hevcygegry egqfceydnf 
541 qcprbsgflc ndrgrcsmgq cvcepgwbgp scdcplsnab cidsnggicn grghcecgrc 
601 hchqqslybd biceinysas brasarbyap acsarrgapa rrrgarvrna bsrsrwwbsl 
661 reairwwcaa psgbrmttap babpwkvbap Igpfcalswch rrrdcppgsf wwliplllll 

30 721 Ipllalllll cwkycaccka clallpccnr ghmvgfkedh ymlrenlmas dhldbpmlrs 
781 gnlkgrdwr wkvtnnmqxp gfabhaasin ptelvpygls lrlarlcben llkpdbreca 
841 qlrqeveenl nevyrqisgv hklqqbkfrq qpnagkkqdh bivdtvlmap rsakpallkl 
901 bekqveqraf hdlkvapgyy blbadqdarg mvefqegvel vdvrvplfir pedddekqll 
961 veaidvpagb ablgrrlvni biikeqardv vsfeqpefsv srgdqvarip virrvldggk 

35 1021 sqvsyrbqdg baqgnrdyip vegellfqpg eawkelqvkl lelqevdsll rgrqvrrfhv 
1081 qlsnpkfgah Igqphsbtii irdpdeldrs ftsqmlssqp pphgdlgapq npnakaagsr 
1141 kihfnwlpps gkpmgyrvky wiqgdsesea hlldskvpsv elbnlypycd yemkvcayga 
1201 qgegpysslv scrbhqevps epgrlafnvv ssbvbqlswa epaebngeit ayevcyglvn 
1261 ddnrpigpmk kvlvdnpknr mllienlres qpyrybvkar ngagwgpere aiinlabqpk 

40 1321 rpmsapiapd xpivdaqsge dydsflmysd dvlrspsgsq rpsvsddtgc gwkfepllge 
1381 eldlrrvbwr Ippeliprls assgrssdae aphgppddgg aggkggslpr sabpgppgeh 
1441 lvngrmdfaf pgsbnslhrm bbbsaaaygb alsphvphrv lsbsstltrd ynslbrsehs 
1501 hsbtlprdys tltsvsshgl ppiwehgrsr Iplswalgsr sraqmkgfpp srgprdsiil 
1561 agrpaapswg pdsrlbagvp dbpbrlvfsa lgpbslrvsw qeprcerplq gysveyqlln 

45 1621 ggelhrlnip npaqbsvwe dllpnhsyvf rvragsqegw greregvibi esqvhpqspl 
1681 cplpgsaftl sbpsapgplv ftalspdslq Iswerprrpn gdivgylvbc emaqgggpat 
1741 afrvdedspe srlbvpglse nvpykfkvqa rbbegfgper egiitiesqd ggpfpqlgsr 
1801 aglfqhplqs eyssisbthb sabepflvgp blgaqhleag gslbrhvbqe fvsrtlttsg 
1861 tlsthradqqf fqb 

50 18. SEP ID NQ:18 Human inte^rin beta4 subunit cds Variant at 34 V to I: 

afcg gca ggg cca cgc ccc age cca bgg gec agg cbg cbc cbg gca gec bbg abc age gtc 
age cbc beb ggg acc btg aac cgc bgc aag aag gec cca AbA aag age bec acg gag bgfc 
gbc egb gbg gab aag gac tgc gec tac bgc aca gac gag abg bbc agg 

gaccggcgctgcaacacccaggcggagctgctggccgcgggcbgccagcgggagagcabcgbggtcabggagagcagcbbc 
55 caaabcacagaggagacccagabbgacaccacccbgcggcgcagccagabgbccccccaaggccbgcgggbccgtcbgcgg 
cccggbgaggagcggcabtbtgagctggaggbgbbtgagccacbggagagccccgbggaccbgbacabccteabggaclbc 
bccaacbccabgbccgabgabcbggacaaccbcaagaagabggggcagaaccbggcbcgggbccbgagccagcbcaccagc 
gacbacacbabbggabbbggcaagbbbgbggacaaagbcagcgtcccgcagacggacabgaggcctcagaagcbgaaggag 
cctbggcccaacagbgacccccccbtcbccbtcaagaacgbcatcagccbgacagaagabgbggabcagbbccggaabaaa 
60 cbgcagggagagcggabcbcaggcaaccbggabgcbccbgagggcggcbbcgabgccabcctgcagacagcbgbgbgcacg 
a 999 a cabfeggctggcgcccggacagcacccacctgctggtcbbctccaccgagbcagccbbccacbabgaggcbgabggc 
gccaacgbgctggcbggcatcatgagccgeaacgafcgaacggbgccaccbggacaccacgggcaccbacacccagbacagg 
acacaggacbacccgbcggbgcccacccfcggbgcgcctgetcgccaagcacaacabcabccccabcbbbgcbgbcaceaae 
bactccbatagctacfcacgagaagcbbeacaccbattbcccbgtcbccbcacbgggggbgcbgcaggaggacbcgtccaac 
65 abcgtggagctgcbggaggaggccbbcaabcggabccgctccaaccbggacatccgggcccbagacagcccccgaggccbt 
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cggacagaggtcaccbccaagabgbbccagaagacgaggactgggtccbbt cacabccggcggggggaagbgggbababac 
caggbgcagctgcgggcccbbgagcacgbggabgggacgcacgbgbgccagctgccggaggaccagaagggcaacatccab 
cbgaaacc t tccttcbccgacggcctcaagatggacgcgggcatcabcbgbgatgbgbgcaccbgcgagcbgcaaaaagag 
gbgcggbcagczcgctgcagctbcaacggagacbbcgbgtgcggacagtgtgtgtgcagcgagggctggagbggccagacc 
5 tgcaactgctccaccggctctctgagtgacattcagccctgcctgcgggagggcgaggacaagccgtgcbccggccgtggg 
gagtgccagtgcgggcacbgbgbgtgcbacggcgaaggccgcbacgagggtcagttcbgcgagtatgacaacbbccagtgt 

ccccgcactt ccgggtfccctctgcaatgaccgaggacgcfcgctccabgggccagfcgtgtgtgtgagccbggt tggacaggc 
ccaagcbgbgactgbccccbcagcaatgccaccbgcabcgacagcaabgggggcabcbgbaabggacgtggccacbgbgag 
bgbggccgcbgccacbgccaccagcagtcgcbcbacacgqacaccabcbgcgagabcaacbacbcggcgbccacccgggcc 
10 tcbgcgaggacctacgcbcctgcgbgcagbgccaggcgtggggcaccggcgagaagaaggggcgcacgbgtgaggaatgca 
acbbcaaggbcaagabggbggacgagcbbaagagaggcgaggaggbggbggtgcgctgcbcctbccgggacgaggabgacg 
actgcaccbacagctacaccatggaaggtgacggcgcccctgggcccaacagcactgbcctggbgcacaagaagaagggac 
bgcccbccgggctccttcbggbggcbcabcccccbgcbccbectccbccbgccgcbccbggcccbgctactgcbgctabgc 
bggaagtacbgzgccbgcbgcaaggccbgccbggcacbbcbcccgbgcbgcaaccgaggbcacabgg-gggcbbbaaggaa 

15 gaccacbacabgctgcgggagaaccbgabggccbcfcgaccacbbggacacgcccabgcbgcgcagcgggaaccbcaagggc 
cgbgacgtggtccgcbggaaggbcaccaacaacabgcagcgqcctggcbtbgccacbcatgccgccagcatcaaccccaca 
gagcbggbgccctacgggcbgbccbbgcgcctggcccgccbbbgcaccgagaaccbgcbgaagcctgacacbcgggagbgc 
gcccagctgcgccaggaggbggaggagaacctgaacgaggtctacaggcagatctccggtgtacacaagctccagcagacc 
aagtbccggcagcagcccaatgccgggaaaaagcaagaccacaccatbgfcggacacagbgctgabggcgccccgcbcggcc 

20 aagccggcccbgctgaagcbtacagagaagcaggbggaacagagggccbtccacgaccbcaaggbggcccccggctacbac 
acccbcacbgcagaccaggacgcccggggcatggtggagbbccaggagggcgbggagcbggbggacgtacgggbgccccbc 
bbbabccggcctgaggabgacgacgagaagcagctgcbggbggaggccatcgacgbgcccgcaggcac tgccaccctcggc 
cgccgccbggbaaacatcaccabcatcaaggagcaagccagagacgbggbgbcctbbgagcagcc t gag b be b egg t cage 
egcggggaccaggbggcccgcabcccbgtcabccggcgbgbcc tggacggcgggaagtcccaggbctcctaccgcacacag 

25 gatggcaccgcgcagggcaaccgggactacatccccgtggagggt gage tgc tgtbccagccbggggaggcctggaaagac 
ctgcaggtgaagctcctggagctgcaagaagt bgactcccbcctgcggggccgccaggtccgccgt t t ccacgtccagcbc 
agcaacccbaag t b tggggcccaccbgggccagccccacbccaccaccatcabcabcagggacccagatgaacbggaccgg 
agcbbcacgagtcagabgbbgbcabcacagccacccccbcacggcgacctgggegccccgcagaaccccaabgcbaaggcc 
gcbgggbccaggaagatccabbbcaactggctgcccccbbcbggcaagccaabggggtacagggtaaagtacbggabbcac 

3 0 ggbgacbccgaat ccgaagcccaccbgcbcgacagcaaggbgccctcagbggagcbcaccaacc tgtacccgbabbgcgac 
babgagatgaaggtgtgcgccbacggggcbcagggcgagggacccbacagctcccbggbgtccbgccgcacccaccaggaa 
gtgcccagcgagccagggcgbcbggccbtcaatgtcgbcbcctccacggtgacccagcbgagctgggctgagccggctgac 
accaacggtgagatcacagccbacgaggbcbgcbabggccbggbcaacgatgacaaccgaccbabbgggcccabgaagaaa 
gbgc tggbtgacaacccbaagaaccggabgcfcgcbbatbgagaacctbcgggagtcccagcccbaccgctacacggbgaag 

35 gcgcgcaacggggccggctgggggccbgagcgggaggccabcafccaacctggccacccagcccaagaggcccabgtccatc 
cccabcabccctgacatcccbabcgbggacgcccagagcggggaggacbacgacagcbbccbbabgtacagcgabgacgbt 
cbacgcbcbcca~ cgggcagccagaggcccagcgbcbccgabgacacbggcbgcggcbggaagbbcgagccccbgctgggg 
gaggagcbggaccbgcggcgcgbcacgtggcggctgcccccggagcbcatcccgcgccbgbcggccagcagcgggcgcbcc 
bccgacgccgaggcgccccacgggcccccggacgacggcggcgcgggcgggaagggcggcagccbgccccgcagbgcgaca 

40 cccgggccccccggagagcaccbggbgaatggccggabggacbt bgceb tcccgggcagcaccaac tccctgcacaggabg 
accacgaccagtgctgcbgccbabggcacccaccbgagcccacacgtgccccaccgcgbgcbaagcacabccbccaccctc 
acacgggactacaactcactgacccgctcagaacachcacactcgaccacactgccgagggactactccaccctcacctcc 
gbcbccbcccacggccbcccbcccabcbgggaacacgggaggagcaggcbbccgcbgbccbgggcccbggggbcccggagb 
cgggcbcagatgaaagggbtcccccctfcccaggggcccscgagactcbabaatccbggcbgggaggccagcagcgcccbcc 

45 bggggcccagactctcgGcbgacbgcbggbgbgcccgacacgcccacccgccbggbgbbcbcbgcccbggggcccacabcb 
cbcagagbgagctggcaggagccgcggtgcgagcggccgcbgcagggctacagtgtggagtaccagctgctgaacggcggb 
gagcbgcatcggctcaacabccccaaccctgcccagaccbcggtggtggtggaagacctcctgcccaaccacbccbacgtg 
tbccgcgbgcgggcccagagccaggaaggcbggggccgagagcgbgagggbgbcabcaccatbgaabcccaggbgcacccg 
cagagcccactgtgbccccbgccaggctccgccttcacbbbgagcacfccccagtgccccaggcccgctggbgtbcacbgcc 

50 cbgagcccagactcgctgcagcbgagcbgggagcggccacggaggcccaabggggababcgtcggcbacctggbgaccbgb 
gagabggcccaaggaggaggbccagccaccgcatbccgggbggabggagacagccccgagagccggctgaccgbgccgggc 
cbcagcgagaacgtgcccbacaagbbcaaggbgcaggccaggaccactgagggctbcgggccagagcgcgagggcabcabc 
sccabagagtcccaggatggaggtccctbcccgcagctgggcagccgtgccgggctctbccagcacccgcbgcaaagcgag 
tacagcagcatctccaccacccacaccagcgccaccgagcccttcctagtgggtccgaccctgggggcccagcacctggag 

55 gcaggcggctccctcacccggcabgbgacccaggagtttgbgagccggacactgaccaccagcggaacccbtagcacccac 
abggaccaacagbbcbbccaaacbbga 

19. SEP ID NO: 19 Degenerate Human integrin befa4 subunit cds variant at 
34 Vto i: 

abg gca ggg cca cgG ccc age cca tgg gec agg ctg cbc ctg gca gee bbg ate age gbc 
60 age etc fccfc ggg acc tbg aac cgc tgc aag aag gec cca At A aag age tgc acg gag tgb 
gbc egb gbg gab aag gac tgc gec bac bgc aca gac gag abg ttc agg 

gaccggcgctgcaacacccaggcggagcbgcbggccgcgggctgccagcgggagagcabcgbggbcatggagagcagcbbc 
caaabcacagaggagacccagabbgacaccaccctgcggegcagccagabgtccccccaaggcctgcgggtccgfcctgcgg 
cccggtgaggagcggcabbbbgagcbggaggbgtttgagccacbggagagccccgbggaccbgbacatcctcabggacbbc 
65 bccaacbccabgtccgabgabcbggacaaccbcaagaagabggggcagaaccbggctcgggbcctgagccagcbcaccagc 
gactacactabtggafctbggcaagtbbgtggacaaagtcagcgbcccgcagacggacatgaggcctgagaagctgaaggag 
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ccbbggcccaacagbgacccccccbbcbccbbcaagaacgbcabcagccbgacagaagabgbggabgagb tccggaabaaa 
cbgcagggagagcggabcbcaggcaaccbggabgcbccbgagggcggcbbcgabgccabccbgcagacagcbgbgbgcacg 
agggacabbggcbggcgcccggcicagcacccaccbgctggbctbctccaccgagbcagcctbccacbabgaggcbgatggc 
gccaacgbgcbggctggcabcabgagccgcaacgabgaacggtgccaccbggacaccacgggcaccbacacccagtacagg 
5 acacaggacbacccgbcggbgcccaccctggbgcgccbgcbcgccaagcacaacab cat ccccabctbtgcbgb caeca ac 
bacbccbabagctacbacgagaagcttcacacctafcfctccctgbcbccbcacbgggggtgcfcgcaggaggacbcgbccaac 
abcgbggagcbgcbggaggaggccbbcaabcggabccgcbccaaccbggacabccgggcccbagacagcccccgaggccbb 
cggacagaggfccacctccaagabgbbccagaagacgaggacbgggbccbbbcacabccggcggggggaagbgggbababac 
caggbgcagcbgcgggcccbbgagcacgbggabgggacgcacgbgbgccagcbgccggaggaccagaagggcaacabccab 

10 cbgaaaccbbccbtcbccgacggccbcaagabggacgcgggcabcabctgtgatgbgbgcacctgcgagcbgcaaaaagag 
gtgcggtcagcbcgctgcagcttcaacggagac ttcgtgtgcggacagbgtgtgbgcagcgagggcbggagbggccagacc 
bgcaacbgcbccaccggcbctctgagtgacatbcagcccbgccbgcgggagggcgaggacaagccgbgcbccggccgfcggg 
gagbgccagbgcgggcacbgbgbgbgcbacggcgaaggccgcbacgagggbcagbbcbgcgagbabgacaacbbccagtgb 
ccccgcacbbccgggbbccbcbgcaabgaccgaggacgcbgcbccabgggccagbgbgbgtgtgagccbggbbggacaggc 

15 ccaagcbgbgacbgtccccbcagcaabgccacc tgcatcgacagcaabgggggcabcbgbaabggacgbggccacbgtgag 
tgbggccgcbgccacbgccaccagcagbcgcbcbacacggacaccabcbgcgagabcaacbacbcggcgbccacccgggcc 
bcbgcgaggacctacgctcctgcgbgcagbgccaggcgtggggcaccggcgagaagaaggggcgcacgbgbgaggaatgca 
acttcaaggtcaagatggtggacgagctbaagagaggcgaggaggtggtggbgcgcbgcfcccttccgggacgaggabgacg 
acbgcaccbacagctacaccafcggaaggbgacggcgccccbgggcccaacagcacbgtccbggbgcacaagaagaagggac 

20 bgcccbccgggcbcctbcbggbggcbcabccccctgctccbccbccbccbgccgcbccbggcccbgcbacbgcbgctafcgc 
bggaagbactgbgcctgctgcaaggcctgccbggcacbtcbcccgtgcbgcaaccgaggtcacabggtgggcbtbaaggaa 
gaccacbacabgcbgcgggagaaccbgabggcctcbgaccacbbggacacgcccabgcfcgcgcagcgggaacctcaagggc 
cgbgacgbggtccgctggaaggb caeca acaacabgcagcggcctggcbbbgccacbcabgccgccagcabcaaccccaca 
gagcbggbgccctacgggcbgtccbbgcgccbggcccgccbbtgcaccgagaaccbgcbgaagccbgacacbcgggagfcgc 

25 gcccagctgcgceaggaggbggaggagaacctgaacgaggtctacaggcagatctccggtgtacacaagctccagcagacc 
aagbbccggcagcagcccaatgccgggaaaaagcaagaccacaccabbgbggacacagtgcbgatggcgccccgcbcggcc 
aagccggccctgcbgaagcbbacagagaagcaggtggaacagagggccbbccacgaccbcaaggtggcccccggctactac 
acccbcacbgcagaccaggacgcccggggcabggbggagbbccaggagggcgbggagcbggbggacgbacgggbgcccctc 
bbbatccggcctgaggatgacgacgagaagcagctgcbggbggaggccabcgacgtgcccgcaggcacbgccaccctcggc 

30 cgccgcctggbaaacatcaccabcabcaaggagcaagccagagacgbggbgbccbbtgagcagcctgagbbcbcggbcagc 
cgcggggaccaggbggcccgcabcccbgbcabccggcgbgtccbggacggcgggaagbcccaggbcbccbaccgcacacag 
gabggcaccgcgcagggcaaccgggacbacabccccgtggagggtgagcbgctgbtccagcctggggaggcctggaaagag 
cbgcaggtgaagcbcctggagctgcaagaagbbgactcccbccbgcggggccgccaggtccgccgtbtccacgtccagcbc 
agcaacccbaagbbbggggcccaccbgggccagccccactccaccaccabcabcabcagggacccagabgaactggaccgg 

35 agcbbcacgagbcagatgbbgbcabcacagccaccccctcacggcgaccbgggcgccccgcagaaccccaabgcbaaggcc 
gctgggbccaggaagabccabtbcaacbggcbgcccccttctggcaagccaabggggbacagggtaaagtacbggabtcag 
ggtgactccgaatccgaagcccacctgctcgacagcaaggtgccctcagbggagctcaccaacctgtacccgtabtgcgac 
tabgagabgaaggtgtgcgcctacggggctcagggcgagggacccbacagctccctggfcgtccbgccgcacccaccaggaa 
gtgcccagcgagccagggcgtctggcctbcaatgtcgtctcctccacggtgacccagctgagctgggctgagccggctgag 

40 accaacggtgagatcacagcctacgaggtctgc tatggee fcggtcaacgatgacaaccgacctafctgggcccatgaagaaa 
gtgctggttgacaaccctaagaaccggatgctgcttattgagaaccttcgggagtcccagccctaccgctacacggtgaag 
gcgcgcaacggggccggcbgggggccbgagcgggaggccabcabcaaccbggccacccagcccaagaggcccabgtccat c 
cccatcatcccbgacatcccbatcgbggacgcccagagcggggaggacbacgacagcb tccbbabgtacagcgabgacgt b 
ctacgcbcbccabcgggcagccagaggcccagcgtctccgabgacactggcbgcggcbggaagbbcgagcccctgcbgggg 

45 gaggagctggacctgcggcgegtcacgtggcggctgcccccggagctcatcccgcgccbgbcggccagcagcgggcgctcc 
tccgacgccgaggcgccccacgggcccccggacgacggcggcgcgggcgggaagggcggcagcctgccccgcagtgcgaca 
cccgggccccccggagagcaectggtgaatggccggatggacttfcgccttcccgggcagcaccaactccctgcacaggatg 
accacgaccagtgctgctgcctatggcacccacctgagcccacacgtgccccaccgcgtgctaagcacatccbccaccctc 
acacgggactacaactcactgacccgcbcagaacactcacactcgaccacacbgccgagggactacbccaccctcacctcc 

50 gbcbccbcccacggcctcccbcccabctgggaacacgggaggagcaggcbbccgcbgtccbgggccctggggbcccggagt 
cgggctcagabgaaagggbbccccccbbccaggggcccacgagacbctabaabccbggcbgggaggccagcagcgcccbcc 
tggggcccagacbcbcgcctgacbgcbggtgbgcccgacacgcccacccgcctggbgbbctctgcccbggggcccacatct 
ctcagagtgagctggcaggagccgcggtgcgagcggccgctgcagggctacagtgtggagcaccagcbgctgaacggcggt 
gage tgeategge tea acatccccaaccctgcccagacct egg tggtggtggaagacctcctgcccaaccactcctacgtg 

55 btccgcgbgcgggcccagagecaggaaggcbggggccgagagcgtgagggbgbcat caeca tbgaabcccaggtgcacccg 
cagagcccactgtgtccccbgccaggctccgcctfccacbbbgagcacbcccagbgccccaggcccgcbggbgbbcacbgcc 
ctgagcccagactcgctgcagcbgagctgggagcggccacggaggcccaatggggababcgbcggcbacctggtgacctgt 
gagabggcccaaggaggaggbccagccaccgcab tccgggbggabggagacagccccgagagccggcbgaccgtgccgggc 
ctcagcgagaacgtgcccbaeaagttcaaggtgcaggccaggaccactgagggcbbcgggccagagcgcgagggcatcatc 

60 accatagagtcccaggatggaggtcccttcccgcagctgggcagccgtgccgggctcttccagcaeccgetgcaaagcgag 
tacagcagcatctccaccacccacaccagcgccaccgagcccttcctagfcgggtccgaccccgggggcccagcacctggag 
gcaggcggcbcccbcacccggcabgtgacccaggagttbgbgagccggacacbgaccaccagcggaacccbbagcacccac 
abggaccaacagbbcb becaaactbga 
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What is claimed is: 

1 . A method of reducing the proliferation of a cancer cell which comprises inhibiting 
ligand binding to an integrin receptor on the cancer cell, wherein the integrin receptor comprises an 
integrin. 

2. The method of claim 1, wherein the integrin receptor comprises integrin B4. 

3. The method of claim 1, wherein the integrin receptor comprises integrin A6. 

4. The method of claim 1, wherein the ligand that binds to the integrin receptor is 
laminin5. 

5. The method of claim 1, wherein the integrin receptor is A6B4. 

6. The method of claim 1, wherein the ligand comprises laminin5. 

7. The method of claim 1, wherein the ligand comprises the garnma-2 subunit of lamininS. 

8. The method of claim 1, wherein inhibiting ligand binding to an integrin receptor 
comprises contacting a A6 integrin with a composition that inhibits ligand binding. 

9. The method of claim 1, wherein inhibiting ligand binding to an integrin receptor 
comprises contacting a B4 integrin with a composition that inhibits ligand binding. 

10. The method of claim 1, wherein inhibiting ligand binding to an integrin receptor 
comprises contacting a laminin5 with a composition that inhibits ligand binding. 

11. The method of claim 1, wherein inhibiting ligand binding to an integrin receptor 
comprises contacting a gamma-2 subunit with a composition that inhibits ligand binding. 

12. A method of reducing the proliferation of a cancer cell which comprises reducing 
integrin-integrin interaction, Integrin receptor clustering interaction, or integrin-non-integrin protein 
interaction. 

13. The method of claim 12, wherein reducing integrin-integrin interaction, integrin 
receptor clustering interaction, or integrin-non-integrin protein interaction comprises contacting a 
A6 integrin with a composition that inhibits an interaction between the B4 integrin and another 
integrin or protein molecule. 

14. The method of claim 12, wherein reducing integrin-integrin interaction, integrin 
receptor clustering interaction, or integrin-non-integrin protein interaction comprises contacting a 
B4 integrin with a composition that inhibits the interaction between the B4 integrin and another 
integrin or protein molecule.. 

15. The method of claim 12, wherein reducing integrin-integrin interaction, integrin 

receptor clustering interaction, or integrin-non-integrin protein interaction comprises contacting a 
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larninin5 wi-th a composition that inhibits ligand binding. 

16. The method of claim 12, wherein reducing integrin-integrin interaction, integrin 
receptor clustering interaction, or integrin-non-integrin protein interaction comprises contacting a 
gamma2 subunit with a composition that inhibits ligand binding. 

17. The method of claim 12, wherein the non-integrin protein comprises a growth factor 
receptor. 

18. The method of claim 12, wherein the non-integrin protein comprises a hemi- 
desomosome junction. 

19. The method of claim 12, wherein the non-integrin protein comprises a SH2 domain. 

20. The method of claim 19, wherein the non-integrin protein comprises a She protein. 

21. The method of claim 12, wherein the non-integrin protein comprises a IRS 1 protein. 

22. The method of claim 12, wherein the non-integrin protein comprises a IRS-2 protein. 

23. A method of reducing the proliferation of a cancer cell which comprises reducing the 
production of an integrin by the cancer cell. 

24. The method of claim 23, wherein the production of an integrin is reduced by inhibiting 
signaling leading to induction of expression of an integrin. 

25. The method of claim 23, wherein reducing the production of an integrin comprises 
inhibiting alpha6 production. 

26. The method of claim 25, wherein inhibiting alpha6 production further comprises using 
antisense molecules to alpha6 mRNA. 

27. The method of claim 23, wherein reducing the production of an integrin comprises 
inhibiting beta4 production. 

28. The method of claim 27, wherein inhibiting beta4 production further comprises using 
antisense molecules to beta4 mRNA, 

29. A method of reducing the proliferation of a cancer cell which comprises reducing the 
production of an integrin receptor ligand by the cancer cell. 

30. The method of claim 29, wherein reducing the production of an integrin receptor 
ligand comprises inhibiting gamma2 production. 

31. The method of claim 29, wherein reducing the production of an integrin receptor 
ligand comprises inhibiting laminin production. 

32. The method of claim 29, wherein reducing the production of an integrin receptor 
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ligand comprises inhibiting Iaminin5 production. 

33. A method of reducing the proliferation of a cancer cell which comprises interfering 
with an integrin signaling pathway. 

34. The method of claim 33, wherein interfering with an integrin signaling pathway 
comprises contacting an A6 integrin in the cell with a composition that inhibits ligand binding. 

35. The method of claim 33, wherein interfering with an integrin signaling pathway 
comprises contacting a B4 integrin with a composition that inhibits ligand binding. 

36. The method of claim 33, wherein inteifering with an integrin signaling pathway 
comprises contacting a laminin5 with a composition that inhibits ligand binding. 

37. The method of claim 33, wherein interfering with an integrin signaling pathway 
comprises contacting a gamma2 subunit with a composition that inhibits ligand binding. 

38. The method of claim 33, wherein interfering with an integrin signaling pathway 
comprises contacting the cancer cell with a molecule that interferes with at least one of talin, 
paxillin, vinculin, a CAS family protein, CRX, NCK, FAK, ILK, Src, Fyn, She, Grb-2, Guaning 
nucleotide exchange factors, SOS, DOCK 180,.Vav, Syk, P 1-3 kinase, AKT, Bad, Bid, Caspase 9, 
Cdc42, PAK, Rac, Rho, Rho kinase, Ras, Caveolin, Tetraspan, Receptor-type protein tyrosine 
phosphatase, SHP-2, Alpha-actinin, Filamin, Cytohesin, Beta3-endonexin. ICAP-1, RACK-1, CLB, 
actin, receptor tyrosine kinase, IRS-1 or IRS -2. 

39. The method of claim 33, wherein interfering with an integrin signaling pathway 
comprises contacting the cancer cell with an agent that interferes with post-translational 
modification of integrins. 

40. The method of claim 39, wherein the post translational modification is 
glycosylation or phosphorylation. 

41. The method of claims 1, 12, 23, 29, 33, wherein the integrin comprises an A6 integrin. 

42. The method of claims 1, 12, 23, 29, 33, wherein the integrin comprises a B4 integrin. 

43. The method of claims 1, 12, 23, 29, 33, further comprising reducing a lamininS- 
integrin interaction. 

44. The method of claims 1, 12, 23, 29, 33, further comprising reducing a laminin5 
gamma2 integrin interaction. 

45. The method of claims 1, 12, 23, 29, 33, wherein reducing the proliferation of the 
cancer cells is selective. 

46. The method of claims 1, 12, 23, 29, 33, wherein the cancer cell is not an MDA-MB- 
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435 cell. 

47. The method of claims 1, 12, 23, 29, 33, wherein the cancer cell is not an HMT-3522 

cell. 

48. The method of claims 1, 12, 23, 29, 33, wherein the cancer cell is not an RKO colon 
carcinoma line. 

49. The method of claims 1, 12, 23, 29, 33, wherein the cancer cell does not express 
exogenous B4 integrin. 

50. The method of claims 1, 12, 23, 29, 33, further comprising contacting the cancer cells 
with a small molecule, peptide, peptide mimetic, or oligonucleotide or synthetic analog thereof. 

51. The method of claim 50, wherein the cancer cells are contacted with dominant- 
negative beta 4 integrin. 

52. The method of claim 50, wherein the cancer cell is contacted with an antisense 
molecule. 

53. The method of claim 52, wherein the antisense molecule is linked to a leader 
sequence which enables translocation across a cell membrane. 

54. The method of claim 53, wherein the leader sequence binds to a cell surface protein 
which facilitates internalization, 

55. The method of claim 54, wherein the leader sequence is TAT or antennapedia, or 
fragment thereof. 

56. The method of claim 52, wherein the antisense molecule is an alpha6 RNA 
antisense molecule. 

57. The method of claim 47, wherein the small molecule peptide, peptide mimetic, or 
oligonucleotide or synthetic analog thereof is linked to a carrier. 

58. The method of claim 57, wherein the carrier is at least one of a Hpidic carrier, 
charged carrier, retroviral earner, TAT or fragment thereof, antennapedia or fragment thereof, or 
polyethylene glycol. 

59. The method of any one of claims 1, 12, 23, 29, 33, further comprising contacting 
the cancer cell with another agent which modulates cell signaling, a chemotherapeutic drug, or 
treated with radiation or angiogenesis inhibitor. 

60. The method of any of claims 1,12, 23, 29, 33, wherein reducing the proliferation 
of cancer cell is in vitro. 

61. The method of any of claims 1, 12, 23, 29, 33, wherein reducing the proliferation 
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of the cancer cell is in vivo. 

62. The method of any of claims 1, 12, 23, 29, 33, wherein the cancer cell is selected 
from the group consisting of melanoma, adenoma, lymphoma, myeloma, carcinoma, 
plasmocytoma, sarcoma, glioma, thyoma, leukemia, skin cancer, retinal cancer, breast cancer, 
prostate cancer, colon cancer, esophageal cancer, slomac cancer, pancreas cancer, brain tumors, 
lung cancer, ovarian cancer, cervical cancer, hepatic cancer, gastrointestinal cancer, and head and 
neck cancer cells. 

63. The method of claims 1, 12, 23, 29, 33, wherein the cancer cell is killed. 

64. The method of claims 1, 12, 23, 29, 33, wherein the cancer cell expresses a mutated 

Ras. 

65. The method of claims 1, 12, 23, 29, 33, wherein the cancer cell expresses a mutated 
Ras and a mutated p53. 

66. The method of claims 1, 12, 23, 29, 33, wherein the cancer cell expresses a mutated 
Ras and activates the AKTfPKB protein, 

67. The method of claims 1, 12, 23, 29, 33, wherein the cancer cell expresses a mutated 
Ras, a mutated p53, and activates the AKT/PKB protein. 

68. The method of claims 1, 12, 23, 29, 33, wherein the cancer cell expresses a mutated 

APC. 

69. The method of claims 1,12, 23, 29, 33, wherein the cancer cell expresses a mutated 
Ras and mutated APC. 

70. A method of isolating molecules that bind with a target molecule selected from the 
group consisting, B4 integrin, alpha6 integrin, and the gamma2 subunit of IamininS comprising 1) 
contacting a library of molecules with the target molecule and 2) collecting molecules that bind the 
target molecule producing an enriched population of molecules. 

7 1 . The method of claim 70, further comprising the step of repeating steps 1 and 2 with 
the enriched population of molecules. 

72. The method of claim 70, wherein the library comprises a small molecule, peptide, 
peptide mimetic, or oligonucleotide. 

73 . A method of assessing a subject's risk of developing cancer comprising determining 
the amount of A6 present in a target cell obtained from the subject, wherein a determination of 
increased levels of A6 correlates With an increased risk of cancer. 

74. The method of claim 73, further comprising comparing the amount A6 present to the 
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amount in a-control cell. 

75. The method of claim 73, wherein determining the amount of A6 present in the target 
cell comprises assaying the amount of A6 mRNA in the cell. 

76. The method of claim 75, wherein the assaying the amount of mRNA in the cell 
comprises hybridizing a A6 probe to a sample of the subject's mRNA. 

77. The method claim 75, wherein the assaying the amount of mRNA in the cell comprises 
hybridizing a A6 primer to a sample of the subject's mRNA. 

78. The method claim 77, wherein the assaying the amount of mRNA further comprises 
performing an nucleic acid amplification reaction involving the primer. 

79. The method of claim 78, wherein the nucleic acid amplification reaction comprises 
reverse transcription, producing a cDNA. 

80. The method of claim 79, wherein the nucleic acid amplification reaction further 
comprises performing a polymerase chain reaction on the cDNA. 

81. The method of claim 73, wherein determining the amount of A6 present in the target 
cell comprises assaying the amount of A6 protein in the cell. 

82. A method of assessing a subject's risk of acquiring cancer comprising determining the 
amount of B4 present in a target cell obtained from the subject, wherein a determination of 
increased levels of B4 coirelates with an increased risk of cancer. 

83. The method of claim 82, further comprising comparing the amount B4 present to the 
amount in a control cell. 

84. A method of assessing a subject's risk of acquiring cancer comprising determining the 
amount of laminin5 present in a target cell obtained from the subject, wherein a determination of 
increased levels of laminin5 correlates with an increased risk of cancer. 

85. The method of claim 84, further comprising comparing the amount lamininS present to 
the amount in a control cell. 

86. A method of assessing a subject's risk of acquiring cancer comprising determining the 
amount of gamma2 subunit present in a target cell obtained from the subjecr, wherein a 
determination of increased levels of gamrna2 subunit correlates with an increased risk of cancer. 

87. The method of claim 86, further comprising comparing the amount gamma2 subunit 
present to the amount in a control cell, 

88. A method of reducing the proliferation of a cancer cell in a patient which comprises 
administering to the patient a composition which inhibits ligand binding to an integrin receptor on 

96 



SUBSTITUTE SHEET (RULE 26) 



WO 02/30465 



PCT/US01/32127 



the cancer cell, wherein the integrin receptor comprises an integrin. 

89. A method of reducing the proliferation of a cancer cell in a patient which comprises 
administering to the patient a composition which reduces integrin-integrin interaction, integrin 
receptor clustering interaction, or integrin-non-integrin protein interaction. 

90. A method of reducing the proliferation of a cancer cell in a patient which comprises 
administering to the patient a composition which reduces the production of an integrin or laminin 
by the cancer cell. 

91. A method of reducing the proliferation of a cancer cell in a patient which comprises 
administering to the patient a composition which interfers with an integrin signaling pathway. 

92. The methods of claims 88-91, wherein the reduction in cancer cell proliferation is 
selective. 

93. The method of any of claims 88-91, wherein the administering is local or systemic. 

94. The method of any of claims 88-91, wherein the patient is additionally 
administered an agent which modulates cell signaling, a chemotherapeutic drug, or treated with 
radiation or angiogenesis inhibitor. 

95. The method of claim 94, wherein the additional agent is administered serially or in 
combination. 

96. The method of claim 93, wherein the local administering is direct application to 
cancer cells. 

97. The method of claim 96, wherein the direct application to cancer cells is performed 
during surgery. 

98. The method of claim 97, wherein the direct application to cancer cells is performed 
topically to cancerous tissue. 

99. The method of claim 94, wherein the systemic administration is by subcutaneous, 
intraperitoneal, intra-arterial, intravenous, or bolus administration, or by application through a 
catheter or similar apparatus. 

100. The method of claim 94, wherein the systemic administration comprises a long- 
term release formulation. 

101. The method of claim 94, wherein the systemic administration is by oral 
administration. 

102. The method of claim 101 wherein the oral administration comprises administering 
a pill, capsule, tablet, liquid or suspension. 

97 



SUBSTITUTE SHEET (RULE 26) 



WO 02/30465 



PCT/US01/32127 



103. A method of reducing the proliferation of a cancer cell in a patient which 
comprises administering to a patient a vector comprising coding sequence For a protein or peptide 
which inhibits ltgand binding to integrin receptors on cancer cells wherein the coding sequence is 
under the control of a promoter which functions in mammalian cells. 

104. A method of reducing the proliferation of a cancer cell in a patient which 
comprises administering to a patient a vector comprising coding sequence for a protein or peptide 
which prevents integrin receptor subunits from interacting with one another, prevents integrin 
receptor clustering interaction, or prevents integrin receptor subunits From interacting with other 
proteins on cancer cells wherein the coding sequence is under the control of a promoter which 
functions in mammalian cells. 

105. A method of reducing the proliferation of a cancer cell in a patient which 
comprises administering to a patient a vector comprising coding sequence for a protein or peptide 
which prevents integrin receptor subunits from interacting with one another, prevents integrin 
receptor clustering interaction, or prevents integrin receptor subunits from interacting with other 
proteins in cancer cells wherein the coding sequence is under the control of a promoter which 
functions in mammalian cells. 

106. A method of reducing the proliferation of a cancer cell in a patient which 
comprises administering to a patient a vector comprising coding sequence for a protein or peptide 
which interferes with integrin subunit or laminin production wherein the coding sequence is under 
the control of a promoter which functions in mammalian cells. 

107. A method of reducing the proliferation of a cancer cell in a patient which 
comprises administering to a patient a vector comprising coding sequence for a protein or peptide 
which interferes with an integrin signaling pathway of cells wherein the coding sequence is under 
the control of a promoter which functions in mammalian cells. 

108. The methods of claims 103-107, wherein the reduction in cancer cell proliferation is 
selective. 

109. The method of claims 103-107 wherein the vector is administered directly to a 
cancer cell. 

110. The method of claims 103-107, wherein the vector is administered directly to a 
normal cell. 

111. The method of claims 103-107, wherein the vector is packaged in a viral vector or 
liposome. 

112. The method of claim 88 wherein the vector is a retroviral vector. 
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113. The method of any one of claims 103-107, wherein the vector Is administered 
systemically. 

1 14. The method of claim 109, wherein the direct administration is by topical 
application. 

1 15. The method of claim 111, wherein the direct administration is by topical 
application. 



surgery. 



116. The method of claim 1 14, wherein the direct administration is performed during 



1 17. The method of claim 1 15, wherein the direct administration is performed during 



surgery. 



1 18. The method of any of claims 103-107, wherein the patient is an animal. 

1 19. The method of claim 118 wherein the animal is a mammal. 

120. The method of claim 1 19, wherein the mammal is human. 

121. The method of claims 103-107, wherein the cancer cells are selected from the 
group consisting of melanoma, adenoma, lymphoma, myeloma, carcinoma, plasmocytoma, 
sarcoma, glioma, thyoma, leukemia, skin cancer, retinal cancer, breast cancer, prostate cancer, 
colon cancer, esophageal cancer, stomac cancer, pancreas cancer, brain tumors, lung cancer, 
ovarian cancer, cervical cancer, hepatic cancer, gastrointestinal cancer, and head and neck cancel- 
ed Is. 

122. The method of any of claims 103-107 wherein the patient is additionally 
administered at least one of another agent which modulates cell signaling, a chemotherapeutic drug, 
an angiogenesis inhibitor or treated with radiation. 

123. The method of claim 120 wherein the other agent which modifies cell signaling, 
chemotherapeutic drug, angiogenesis inhibitor or radiation treatment is administered serially or in 
combination. 
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SEQUENCE LISTING 

<110> University of Rochester Medical Center 

Deleu, Laurent 
Land, Hartmut 

<120> COMPOSITIONS THAT INHIBIT PROLIFERATION 
OF CANCER CELLS 

<130> 21108. 0G05P1 

<150> 60/239,705 
<151> 2000-10-12 

<150> 60/242,812 
<151> 2000-10-24 

<160> 19 

<170> FastSEQ for Windows Version 4 . 0 

<210> 1 
<211> 3222 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Mote — 
synthetic construct 



<40Q> 1 

atggccgccg ccgggcagct gtgcttgctc tacctgtcgg cggggctcct gtcccggctc 60 

ggcgcagcct tcaacttgga cactcgggag gacaacgtga tccggaaata tggagacccc 120 

gggagcctct tcggcttctc gctggccatg cactggcaac tgcagcccga ggacaagcgg 180 

ctgttgctcg tgggggcccc gcgcggagaa gcgcttccac tgcagagagc caacagaacg 2 40 

ggagggctgt acagctgcga catcaccgcc cgggggccat gcacgcggat cgagtttgat 300 

aacgatgctg accccacgtc agaaagcaag gaagatcagt ggatgggggt caccgtccag 360 

agccaaggtc cagggggcaa ggtcgtgaca tgtgctcacc gatatgaaaa aaggcagcat 420 

gttaatacga agcaggaatc ccgagacatc tttgggcggt gttatgtcct gagtcagaat 480 

ctcaggattg aagacgatat ggatggggga gattggagct tttgtgatgg gcgatfcgaga 540 

ggccatgaga aatttggcfcc ttgccagcaa ggtgtagcag ctacttrtac taaagacttt 600 

cattacattg tatttggagc cccgggtact tataactgga aagggattgt tcgtgtagag 660 

caaaagaata acactttt tt tgacatgaac atctttgaag atgggcctta tgaagttggt 720 

ggagagactg agcatgatga aagtctcgtt cctgttcctg ctaacagtta cttaggtttt 780 

tctttggact cagggaaagg tattgtttct aaagatgaga tcacttttgt atctggtgct 840 

cccagagcca atcacagigg agccgtggtt ttgctgaaga gagacatgaa gtctgcacat 900 

ctcctccctg agcacatatt cgatggagaa ggtctggcct cttcatttgg ctatgatgtg 960 

gcggtggtgg acctcaacaa ggatgggtgg caagatatag ttattggagc cccacagtat 1020 

tttgatagag atggagaagt tggaggtgca gtgtatgtct acatgaacca gcaaggcaga 1080 

tggaataatg tgaagccaat tcgtcttaet ggaaccaaag attctatgtt fcggcattgca 1140 

gtaaaaaata ttggagatat taatcaagat ggctacccag atattgcagt tggagctccg 1200 

tatgatgact tgggaaaggt ttttatctat catggatctg caaatggaat aaataccaaa 1260 

ccaacacagg ttctcaaggg tatatcacct tattttggat attcaattgc tggaaacatg 1320 

gaccttgatc gaaattccta ccctgatgtt gctgttggtt ccctctcaga ttcagtaact 1380 

attttcagat cccggcctgt gattaatatt cagaaaacca tcacagtaac tcctaacaga 1440 

attgacctcc gccagaaaac agcctgtggg gcgcctagtg ggatatgcct ccaggttaaa 1500 

tcctgfctttg aatatactgc taaccccgct ggttataatc cttcaatatc aattgtgggc 1560 

acacttgaag ctgaaaaaga aagaagaaaa tctgggctat cctcaagagt rcagt ttcga 1 62 0 

aaccaaggtt ctgagcccaa a batactcaa gaactaactc tgaagaggca gaaacagaaa 1680 

gtgtgcatgg aggaaaccct gtggctacag gataatatca gagataaact gcgtcccatt 1740 
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cccataactg cctcagtgga gatccaagag ccaagctctc gtaggcgagt gaattcactt 18D0 

ccagaagttc ttccaattct gaattcagat gaacccaaga cagctcatat tgatgttcac 1860 

ttcttaaaag agggatgtgg agacgacaat gtatgtaaca gcaaccttaa actagaatat 1920 

aaattttgca cccgagaagg aaatcaagac aaabtbtcbt atttaccaat tcaaaaaggt 1980 

gtaccagaac tagttctaaa agatcagaag gatattgctt tagaaataac agtgacaaac 2040 

agcccttcca acccaaggaa tcccacaaaa gatggcgatg acgcccatga ggctaaactg 2100 

abtgcaacgt ttccagacac :ttaaccLa: tctgcatata gagaactgag ggctttccct 2160 

gagaaacagt tgagttgtgt tgccaaccag aatggctcgc aagctgactg tgagctcgga 2220 

aatcctttta aaagaaattc aaatgtcact ttttatttgg ttttaagtac aactgaagtc 2280 

acctttgaca ccccabatct ggatattaat ctgaagttag aaacaacaag caatcaagat 2340 

aatttggctc caattacagc taaagcaaaa gtggttattg aactgctttt atcggtctcg 240'0 

ggagttgcta aaccttccca ggtgtatttt ggaggtacag ttgttggcga gcaagctatg 24 60 

aaatctgaag atgaagtggg aagtttaata gagtatgaat tcagggtaat aaacttaggt 2520 

aaacctctta caaacctcgg cacagcaacc ttgaacattc agtggccaaa agaaattagc 2580 

aatgggaaat ggttgcttta tttggtgaaa gtagaatcca aaggattgga aaaggtaact 2640 

tgtgagccac aaaaggagat aaactccctg aacctaacgg agtctcacaa ctcaagaaag 2700 

aaacgggaaa ttactgaaaa acagatagat gataacagaa aattttcttt atttgctgaa 2760 

agaaaatacc agactcttaa ctgtagcgtg aacgtgaact gtgtgaacat cagatgcccg 2820 

ctgcgggggc tggacagcaa ggcgtctctt attttgcgct cgaggttatg gaacagcaca 2880 

tttctagagg aatattccaa actgaactac ttggacattc tcatgcgagc cttcattgat 2940 

gtgactgctg ctgccgaaaa tatcaggctg ccaaatgcag gcactcaggb tcgagtgact 3000 

gtgtttccct caaagactgt agcbcagbat tcgggagtac cttggtggat catcctagtg 3060 

gcbattctcg ctgggatctb gatgcttgct ttabtagtgt ttatactatg gaagtgtggt 3120 

ttcttcaaga gaaataagaa agatcattat gatgccacat atcacaaggc tgagatccat 3180 

gcbcagccat cbgataaaga gaggcttact tctgatgcat ag 3222 

<210> 2 
<211> 1072 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence; Note = 
synthetic construct 



<400> 2 






























Met 


Ala 


Ala 


Ala 


Gly 


Gin 


Leu 


Cys 


r»eu 


Leu 


Tyr 


Leu 


Ser 


Ala 


Gly 


Leu 


1 








5 










10 








15 




Leu 


Ser 


Arg 


Leu 


Gly 


Ala 


Ala 


Phe 


Asn 


Leu 


Asp 


Thr 


Arg 


Glu 


Asp 


Asn 








20 










25 








3 0 




Val 


lie 


A.rg 
35" 


Lys 


Tyr 


Gly 


Asp 


Pro 
40 


Gly 


Ser 


Leu 


Phe 


Gly 
4 5 


Phe 


Ser 


Leu 


Ala 


Met 


His 


Trp 


Gin 


Leu 


Gin 


Pro 


Glu 


Asp 


Lys 


Arg 


Leu 


Leu 


Val 


Gly 




5 0 










c c; 










60 








Ala 


Pro 


Arg 


Gly 


Glu 


Ala 


Leu 


Pro 


Leu 


Gin 


Arg 


Ai~a 


Asn 


Arg 


Thr 


Gly 


65 










70 










75' 








80 


Gly 


Leu 


Tyr 


Ser 


Cys 
85 


Asp 


lie 


Thr 


Ala 


Arg 
90 


Gly 


Pro 


Cys 


Thr 


Arg 
95 


lie 


Glu 


Phe 


Asp 


Asn 


Asp 


Ala 


Asp 


Pro 


Thr 


Ser 


Glu 


Ser 


Lys 


Glu 


Asp 


Gin 








100 










105 








110 




Trp 


Met 


Gly 


Val 


Thr 


Val 


Gin 


Ser 


Gin 


Gly 


Pro 


Gly 


Gly 


Lys 


Val 


Val 






115 


His 








120 








125 








Thr 


Cys 


Ala 


Arg 


Tyr 


Glu 


Lys 


Arg 


Gin 


His 


Val 


Asn 


Thr 


Lys 


Gin 




130 










135 










14 0 








Glu 


Ser 


Arg 


Asp 




Phe 


Gly 


Arg 


Cys 


Tyr 


Val 


Leu 


Ser 


Gin 


Asn 


Leu 


145 










15 0 










155 










160 


Arg 


lie 


Glu 


Asp 


Asp 


Met 


Asp 


Gly 


Gly 


Asp 


Trp 


Ser 


Phe 


Cys 


Asp 


Gly 










165 










170 










175 


Arg 


Leu 


Arg 


Gly 


His 


Glu 


Lys 


Phe 


Gly 


Ser 


Cys 


Gin 


Gin 


Gly 


Val 


Ala 








180 










185 








190 






Ala 


Thr 


Phe 


Thr 


Lys 


Asp 


Phe 


His 


Tyr 


lie 


Val 


Phe 


Gly 


Ala 


Pro 


Gly 






195 










200 










205 
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Thr 


Tyr 


Asn 


Trp 




210' 






Phe 


Phe 


Asp 


Met 


225 








Glu 


Thr 


Glu 


His 


Leu 


Gly 


Phe 


Ser 








260 


He 


Thr 


Phe 


Val 






275 




Val 


Leu 


Leu 


Lys 




290 






He 


Phe 


Asp 


Gly 


305 








Val 


Val 


Asp 


Leu 


Pro 


Gin 


Tyr 


Phe 








340 


Tvr 


Met 


Asn 


Gin 






355 




Asn 


Gly 


Thr 


Lys 




370 






Asp 


He 


Asn 


Gin 


385 








Asp 


Asp 


Leu 


Gly 


Asn 


Thr 


Lys 


Pro 








420 


Tyr 


Ser 


He 


Ala 






435 




Val 


Ala 


Val 


Gly 




450 






Pro 


Val 


Tip 


O J. J. 


465 








Asp 


Leu 


Arg 


Gin 


Gin 


Val 


Lys 


Ser 








50 0 


Pro 


Ser 


He 


Ser 






515 




Lys 


Ser 


Glv 


Leu 




530 






Pro 


Lvs 


Tvr 


Thr 


545 








Cys 


Met 


Glu 


Glu 


Arg 


Pro 


He 


Pro 








580 


Arg 


Arg 


Arg 


Val 






595 




Asp 


Glu 


Pro 


Lys 




610 






Cys 


Gly 


Asp 


Asp 


625 








Phe 


Cys 


Thr 


Arg 


Gin 


Lys 


Gly 


Val 








660 


Leu 


Glu 


He 


Thr 






675 




Lys 


Asp 


Gly 


Asp 




690 







Lys 


Gly 


He 


Val 






215 




Asn 


He 


Phe 


Glu 




230 






Aso 


Gl u 


Ser 

*w 


Lph 


245 








Leu 


Asp 


Ser 


Gly 


Ser 


Gly 


Ala 


Pro 








280 


Arg 


Asp 


Met 


Lys 






295 




Glu 


Gly 


Leu 


Ala 




310 






Aon 


nys 


H on 
i-iSp 


tn.L v 


325 








Asp 


Arg 


Asp 


Gly 


Gin 


Gly 


Arg 


Trp 








360 


Asp 


Ser 


Met 


Phe 






375 




Asp 


Gly 


Tyr 


Pro 




390 






T ,.t j c; 

juy O 


V a JL 


ir ne 


Tie 

r re 


405 








Thr 


Gin 


Val 


Leu 


Gly 


Asn 


Met 


Asp 








440 


Ser 


Leu 


Ser 


Asp 






455 




He 


Gin 


Lys 


Thr 




4 70 














485 








Cys 


Phe 


Glu 


Tyr 


He 


Val 


Gly 


Thr 








520 


Ser 


Ser 


Arg 


Val 






535 




Gin 


Glu 


Leu 


Thr 




550 






*f H v 

x nr 


i^eu 


irp 


rieu 


565 








He 


Thr 


Ala 


Ser 


Asn 


Ser 


Leu 


Pro 








600 


Thr 


Ala 


His 


He 






615 




Asn 


Val 


Cys 


Asn 




630 






Glu 


Gly 


Asn 


Gin 


64 5 








Pro 


Glu 


Leu 


Val 


Val 


Thr 


Asn 


Ser 








680 


Asp 


Ala 


His 


Glu 






695 





3 



Arg 


Val 


Glu 


Gin 








220 


Asp 


Gly 


Pro 


Tyr 






235 




Val 


Pro 


Val 


Pro 




250 






uy s 




x r e 


v ar 


265 








Arg 


Ala 


Asn 


His 


Ser 


Ala 


His 


Leu 








300 


Ser 


Ser 


Phe 


Gly 






315 




Trp 


Gin 


Asp 


He 




330 








v el JL 


o j. y 


(21 \7 

ijjLy 


345 








Asn 


Asn 


Val 


Lys 


Gly 


He 


Ala 


Val 








38 0 


Asp 


He 


Ala 


Val 






395 




Tyr 


His 


Gly 


Ser 




410 






JbyS 


(jJL y 


He 


Ser 


425 








Leu 


7^sp 


Arg 


Asn 


Ser 


Val 


Thr 


He 








4 60 


He 


Thr 


Val 


Thr 






475 




Gly 


Ala 


Pro 


Ser 




4 90 






T h\ y~ 

jl nr 


riX. 3 


riS n 


"O -v^- r~^t 

rlO 


505 








Leu 


Glu 


Al a 


Glu 


Gin 


Phe 


Arg 


Asn 








540 


Leu 


Lys 


Arg 


Gin 






555 




Gin 


Asp 


Asn 


He 




570 






v a r 


C 1 ii 

bJLU 


XXB 


hi n 


585 








Glu 


Val 


Leu 


Pro 


Asp 


Val 


His 


Phe 








620 


Ser 


Asn 


Leu 


Lys 






635 


Asp 


Lvs 


Phe 


Ser 




65 0 






Leu 


Lys 


Asp 


Gin 


665 








Pro 


Ser 


Asn 


Pro 


Ala 


Lvs 


Leu 


He 



Lys Asn Asn Thr 

Glu Val Gly Gly 

240 

Ala Asn Ser Tyr 
255 

Ser Lys Asp Glu 
27 0 

Ser Gly Ala Val 
285 

Leu Pro Glu His 

Tyr Asp Val Ala 

320 

Val He Gly Ala 
335 

Ala Val Tyr Val 
350 

Pro He Arg Leu 
365 

Lys Asn He Gly 

Gly Ala Pro Tyr 

4 00 

Ala Asn Gly He 
415 

Pro Tyr Phe Gly 
430 

Ser Tyr Pro Asp 
445 

Phe Arg Ser Arg 

Pro Asn Arg He 

480 

Gly He Cys Leu 
495 

Ala Gly Tyr Asn 
510 

Lys Glu Arg Arg 
525 

Gin Gly Ser Glu 

Lys Gin Lys Val 

560 

Arg Asp Lys Leu 
575 

Glu Pro Ser Ser 
590 

He Leu Asn Ser 
605 

Leu Lys Glu Gly 

Leu Glu Tyx~ Lys 

64 0 

Tyr Leu Pro lie 
655 

Lys Asp lie Ala 
67 0 

Arg Asn Pro Thr 
685 

Ala Thr Phe Pro 
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Asp 


Thr 


Leu 


Thr 


Tyr 


Ser 


Ala 


Tyr 


Arg 


Glu 


Leu 


Arg 


Ala 


Phe 


Pro 


Glu 


7 05 










710 










715 










720 


Ly s 


Gin 


Leu 


Ser 


Cys 
725 


Val 


Ala 


Asn 


Gin 


Asn 
7 30 


Gly 


Ser 


Gin 


Ala 


Asp 
7 35 


Cys 


Glu 


Leu 


Gly 


Asn 


Pro 


Phe 


Lys 


Arg 


Asn 


Ser 


Asn 


Val 


Thr 


Phe 


Tyr 


Leu 








740 










745 










7 50 




Val 


Leu 


Ser 


Thr 


Thr 


Glu 


Val 


Thr 


Phe 


Asp 


Thr 


Pro 


Tyr 


Leu 


Asp 


He 






755 










760 










765 






Asn 


Leu 


Lys 


Leu 


Glu 


Thr 


Thr 


Ser 


A s n 


Gin 


Asp 


Asn 


Leu 


Ala 


Pro 


He 




770 










775 








78 0 










Thr 


Ala 


Lys 


Ala 


Lys 


Val 


Val 


He 


Glu 


Leu 


Leu 


Leu 


Ser 


Val 


Ser 


Gly 


785 










7 90 










795 










8 00 


Val 


Ala 


Lys 


Pro 


Ser 
8 05 


Gin 


Val 


Tyr 


Phe 


Gly 
810 


Gly 


Thr 


Val 


Val 


Giy 
815 


Glu 


Gin 


Ala 


Met 


Lys 


Ser 


Glu 


Asp 


Glu 


Val 


Gly 


Ser 


Leu 


He 


Glu 


Tyr 


Glu 








820 










825 










830 




Phe 


Arg 


Val 
835 


He 


Asn 


Leu 


Gly 


Lys 
840 


Pro 


Leu 


Thr 


Asn 


Leu 
845 


Gly 


Thr 


Ala 


Thr 


Leu 

850 


Asn 


He 


Gin 


Trp 


Pro 
855 


Lys 


Glu 


He 


Ser 


Asn 
860 


Gly 


Lys 


Trp 


Leu 


Leu 


Tyr 


Leu 


Val 


Lys 


Val 


Glu 


Ser 


Lys 


Giy 


Leu 


Glu 


Lys 


Val 


Thr 


Cys 


865 










370 










875 










880 


Glu 


Pro 


Gin 


Lys 


Glu 
885 


He 


Asn 


Ser 


Leu 


Asn 
890 


Leu 


Thr 


Glu 


Ser 


His 
895 


Asn 


Ser 


Arg 


Lys 


Lys 
900 


Arg 


Glu 


He 


Thr 


Glu 
905 


Lys 


Gin 


He 


Asp 


Asp 
910 


Asn 


Arg 


Lys 


Phe 


Ser 


Leu 


Phe 


Ala 


Glu 


Arg 


Lys 


Tyr 


Gin 


Thr 


Leu 


Asn 


Cys 


Ser 






915 










920 










925 






Val 


Asn 
930 


Val 


Asn 


Cys 


Val 


Asn 
935 


He 


Arg 


Cys 


Pro 


Leu 
94 0 


Arg 


Gl^- 


Leu 


Asp 


Ser 


Lys 


Ala 


Ser 


Leu 


He 


Leu 


Arg 


Ser 


Arg 


Leu 


Trp 


Asn 


Ser 


Thr 


Phe 


945 










950 










95 5 










960 


Leu 


Glu 


Glu 


Tyr 


Ser 
965 


Lys 


Leu 


Asn 


Tyr 


Leu 
970 


Asp 


He 


Leu 


Met 


Arg 
975 


Ala 


Phe 


He 


Asp 


Val 
980 


Thr 


Ala 


Ala 


Ala 


Glu 
985 


Asn 


3-* CE» 


Arg 


Leu 


Pro 
990 


Asn 


Ala 


Gly 


Thr 


Gin 


Val 


Arg 


Val 


Thr 


Val 


Phe 


Pro 


Ser 


Lys 


Thr 


Val 


Ala 


Gin 






995 










100 0 








1005 






Tyr 


Ser 


Gly 


Val 


Pro 


Trp 


Trp 


He 


He 


Leu 


Val 


Ala 


lie 


Leu 


Ala 


Gly 




1010 








1015 








1020 








He 


Leu 


Met 


Leu 


Ala 


Leu 


Leu 


Val 


Phe 


He 


Leu 


Trp 


Lys 


Cys 


Gly 


Phe 


1025 








1030 








1035 








1040 


Phe 


Lys 


Arg 


Asn 


Lys 


Lys 


Asp 


His 


Tyr 


Asp 


Ala 


Thr 


Tyr 


His 


Lys 


Ala 










1045 








1050 








1055 


Glu 


He 


His 


Ala 


Gin 


Pro 


Ser 


Asp 


Lys 


Glu 


Arg 


Leu 


Thr 


Ser 


Asp 


Ala 



1060 1065 1070 



<210> 3 
<211> 3222 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Mote = 
synthetic construct 

<400> 3 

atggccgtcg cgggccagtt gtgcctgctc tacctgfcccg cggggcttct agcccggctg 60 

ggtacagcct tcaacctgga cacccgcgag gacaacgtga tccggaaatc gggggafcccc 120 

gggagcctct tcggcttctc gctcgccatg cactggcagt tgcagccgga ggacaagcgg 180 

ctgttgcttg tgggggcacc tcgggcagaa gcactcccgc tgcagagggc gaacagaaca 240 

gggggcctgt acagctgtga catcacctcc cgaggacctt gtacacggat tgaatttgat 300 
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aabgacgctg atcctatgtc agaaagcaag gaagaccagt ggabgggagt cactgtccag 360 

agccaaggbc cagggggcaa agbggbgacg bgbgcacabc gatatgagaa acggcagcac 420 

gtcaacacga agcaggagtc gcgggafcabc tttggaagat gbbabgbcct gagtcagaab 480 

cbcagaattg aagatgatat ggacggagga gactggagtt tctgcgatgg ccggttgaga 540 

ggccatgaaa agtttggctc ctgtcagcaa ggagtagcgg cbactttcac taaggacttt 600 

cattacattg tttttggagc cccagggact tacaactgga aagggatcgt ccgtgtagaa 660 

caaaagaaba acactbbttb tgacatgaac atctttgaag atgggcccta bgaagtbggb 720 

ggagagacag atcatgatga aagtcbcgbg cccgbbccbg ctaacagbba cctaggcttt 780 

bcgctggacb cagggaaggg babtgbbbct aaagabgaca tcacttttgt gbctggtgct 840 

ccaagagcca atcacagtgg ggctgtagtg ttgctaaaaa gagacabgaa gtccgcacat 900 

ctgctccctg agtatatatt tgacggagaa ggcctggctt cctcgtttgg ctatgatgbg 960 

gcagtggtgg acctcaatgc agatgggtgg caagacatcg ttatcggagc tccacagtat 1020 

ttfcgataggg abggbgaagb cgggggtgca gbbbacgtct acattaacca gcaaggcaaa 1080 

tggagtaatg tgaagccgat tcgtctaaat gggaccaaag actcgatgtt tggaatctct 1140 

gtgaaaaata taggbgatat baaccaagab ggcbabccag ababtgcbgb tggagctccc 1200' 

tatgabgatc bggggaaggt bbbtatcbab catggabccc cgactggcab aabbaccaag 1260 

ccaacacagg ttcbcgaggg gacabcgccb bacttcggcb abbcaatcgc igggaatatg 1320 

gacctggatc ggaattccta ccccgaccbt gctgbgggcb ccctctcaga cbcggtcact 1380 

atbbbcagab cccggccagb gabbaacabb cbaaaaacca tcacagtgac- tcctaacaga 1440 

atbgaccbcc gccagaagbc catqtgtggc bcaccbagcg ggatabgccb caaggtbaaa 1500 

gcctgbtbtg aatatacbgc gaaaocbtcc ggttataacc cbccaatatc aabbttgggt 1560 

atbcbbgaag ctgaaaaaga aagaagaaaa tcagggttgt catcgagagt tcagttbcga 1620 

aaccaaggbb ccgagccaaa gbabacbcag gagcbgaccc tgaabcggca gaagcagcgg 1680 

gcgbgcabgg aggagaccct cbggctgcag gagaacabca gagacaagcb gcgbcccabc 174 0 

cccabcacgg cttctgtgga gabccaggag cccacgtcbc gccggcgggb gaacbcacbc 1800 

cccgaagttc bbcccatcct gaabtcaaat gaagccaaaa cggtccagac agabgtccac 1860 

btcbbaaagg aaggabgbgg agacgacaat gbctgbaaca gcaaccbbaa gcbagagbab 1920 

aaatttggta cccgagaagg aaatcaagac aaattcbcbt accttccaab tcaaaaaggc 198 0 

atcccagaab tagbcctaaa agatcagaaa gabatagcbc bggaaataac ggtgaccaac 204 0 

agccctbcgg abccaaggaa bccccggaaa gabggcgacg abgcccabga agccaaacbc 2100' 

atcgccacgb bbccagacac bctgacatat bccgcbbaca gagaactgag ggcbbtcccb 2160 

gagaagcagc bgagcbgbgb ggccaaccag aabggcbccc aagccgacbg bgagcbcgga 2220 

aatcctttca agagaaabbc cagtgttacb btctatctga tbbbaagtac aaccgaggbc 2280 

acctbtgaca ccacagatct ggatattaat ctgaagttgg aaacaacaag caatcaggat 2340 

aaabbggcbc caabbacagc gaaggcaaaa gtggbtatbg aattgctttb atccctctcc 2400 

ggagbcgcba agccbbcgca ggbgbabbbb ggaggbacag bbgbbggbga gcaagcbabg 2460 

aaatcbgaag abgaagtagg aagbttaata gagtabgaab ttagggtgab taacttaggc 2520 

aagcctcbta aaaaccbcgg cacagcaacc bbgaababac agbggcccaa ggagabbagc 2580 

aabggcaaab ggbtgctbta bbbgatgaaa gbbgaabcca aaggtbtgga gcagatbgbt 2640 

tgbgagccac acaatgaaab aaactaccbg aagctgaagg agtctcacaa ctcaagaaag 2700 

aaacgggaac bbccbgaaaa acagabagat gacagcagga aattttcttt abbbccbgaa 2760 

agaaaabacc agacbcbcaa cbgcagcgbc aacgbcaggb gbgbgaacab caggtgccca 2820 

ctgcgagggc bggacacgaa ggccbcbcbc gbtcbgbgbb ccaggttgbg gaacagcaca 2880 

ttbcbagagg aababbccaa acbgaacbac bbggacabbc bcgbgagggc bbccabagab 2S40 

gtcaccgctg cbgcbcagaa batcaagctc cctcacgcgg gcactcaggt tcgagtgacg 3000 

gbgbbbcccb caaagactgt agct cagbat bcaggagbag ctbggbggab cabcctccbg 3060 

gcbgttctbg ccgggabbcb gabgcbggct cbabtagtgt bbbbactgbg gaagbgbggc 3120 

bbcbbcaaga gaaabaagaa agatcabtac gabgccaccb atcacaaggc bgagabccab 3180 

actcagccgt cbgabaaaga gaggcbbacb bccgabgcab ag 3222 



<210> 4 
<211> 1073 
<212> PRT 

<213> Arbificial Sequence 
<220> 

<223> Description of Artificial Seguence; Note = 
synthetic construct 



<400> 4 

Met Ala Val Ala Gly Gin Leu Cys Leu Leu Tyr Leu Ser Ala Gly Leu 
15 10 15 
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liSU 


Aia 


Arg 


Leu 








2 0 


val 


T^i 


Arg 


T 

Lys 






35 






Met 


His 


Trp 




50 






Gly 


Ala 


Pro 


Arg 


65 








*ny 


Gly 


Leu 


f 1 1 _ _ _ _ 

Tyr 


lie 


Glu 


Phe 


Asp 








100 


Gin 


Trp 


Met 


Gly 






115 




Val 


Thr 


Cys 


Ala 




130' 






Gin 


Glu 


Ser 


Arg 


145 








Leu 


Arg 


lie 


Glu 


Gly 


Arg 


Leu 


Arg 








180 


Ala 


Ala 


Thr 


Phe 






195 




Gly 


Thr 


Tyr 


Asn 




210 






Thr 


Phe 


Phe 


Asp 


225 








Gly 


Glu 


t-n 1 

Thr 


Asp 


Ty^ 


Leu 


Gly 


Phe 








2 60 


Asp 


lie 


Thr 


Phe 






275 




Val 


XT T 

V al 


Leu 


Leu 




290 






Tyr 


lie 


Phe 


Asp 


305 








Ala 


Val 


Val 


Asp 


TV "1 — 

Ala 


Pro 


Gin 


Tyr 








34 0 


-i- -r 1 

Val 


Tyr 


lie 


Asn 






355 




Leu 


Asn 


Gly 


Thr 




■"S »~I <r—\ 

370 




Gly 


Asp 


lie 


Asn 


385 








Tyr 


Asp 


Asp 


Leu 


lie 


lie 


Thr 


Lys 








420 


Gly 


Tyr 


Ser 


lie 






435 




Asp 


Leu 


Ala 


val 




4 50 






Arg 


Pro 


Val 


lie 


465 








lie 


Asp 


Leu 


Arg 


Leu 


Lys 


Val 


Lys 








500 



Gly 


Thr 


Ala 


Phe 


Ser 


Gly 


Asp 


Pro 








40 


Gin 


Leu 


Gin 


Pro 






55 




Ala 


Glu 


Ala 


Leu 




7 0 






Ser 


Cys 


Asp 


lie 


85 








Asn 


Asp 


Ala 


Asp 


Val 


Thr 


Val 


Gin 








12 0 


His 


Arg 


Tyr 


Glu 






135 




Asp 


lie 


■i-t^ l"! 


Gly 




15 0 






Asp 


Asp 


Met 


Asp 


165 








Gly 


His 


Glu 


Lys 


Thr 


Lys 


Asp 


Phe 








200 


Trp 


Lys 


Gly 


lie 






215 




Met 


Asn 


lie 


Phe 




23 0 






His 


Asp 


Glu 


Ser 


245 








Ser 


Leu 


Asp 


Ser 


Val 


Ser 


Gly 


Ala 








280 


Lys 


Arg 


Asp 


Met 






295 




Gly 


Glu 


Gly 


Leu 




310 






Leu 


Asn 


Ala 


Asn 


325 








Phe 


Asp 


Arg 


Asp 


Gin 


Gin 


Gly 


Lys 








3 60 


Lys 


Asp 


Ser 


Met 






375 




Gin 


Asp 


Gly 


Tyr 




390 






Gly 


Lys 


Val 


Phe 










Pro 


Thr 


Gin 


Val 


Ala 


Gly 


Asn 


Met 








4 4 0 


Gly 


Ser 


Leu 


Ser 






455 




Asn 


He 


Leu 


Lys 




470 






Gin 


Lys 


Ser 


Met 


485 








Ala 


Cys 


Phe 


Glu 



6 



Asn 


Leu 


Asp 


Thr 


25 






Gly 


d e r 


Leu 


Phe 


Glu 


Asp 


Lys 


Arg 








60 


Pro 


Leu 


Gin 


Arg 






75 




Thr 


Ser 


Arg 


Gly 




90 






Pro 


Met 


Ser 


Glu 


105 








Ser 


Gin 


Gly 


Pro 


Lys 


Arg 


Gin 


His 








140 


Arg 


Cys 


Tyr 


Val 






155 




Gly 


Gly 


Asp 


Trp 




17 0 






Phe 


Glv 


Ser 


Cvs 


185 








His 


Tyr 


He 


Val 


Val 


Arg 


Val 


Glu 








2 20 


Glu 


Asp 


Gly 


Pro 






235 




Leu 


Val 


Pro 


Val 




250 






Gly 


Lys 


Gly 


He 


2 65 








Pro 


Arg 


Ala 


Asn 


Lys 


Ser 


Ala 


His 








300 


Ala 


Ser 


Ser 


Phe 






315 




Gly 


Trp 


Gin 


Asp 




3 30 






Gly 


Glu 


Val 


Glv 


345 








Trp 


Ser 


Asn 


Val 


Phe 


Gly 


He 


Ser 








380 


Pro 


Asp 


He 


Ala 






395 




He 


Tyr 


His 


Gly 




410 






Leu 


Glu 


Gly 


Thr 


425 








Asp 


Leu 


Asp 


Arg 


Asp 


Ser 


Val 


Thr 








460 


Thr 


He 


Thr 


Val 






47 5 




Cys 


Gly 


Ser 


Pro 




490 






Tyr 


Thr 


Ala 


Lys 


505 









Arg 


Glu 


Asp 


7\sn 




30 






Gly 


Phe 


Ser 


Leu 


45 








Leu 


Leu 


Leu 


Val 


Ala 


Asn 


Arg 


Thr 








80 


Pro 


Cys 


Thr 


Arg 






95 




Ser 


Lys 


Glu 


Asp 




110 






Glv 


Glv 


Ti v s 


Val 


125 








Val 


Asn 


Thr 


Lys 


Leu 


Ser 


Gin 


Asn 








160 


Ser 


Phe 


Cys 


Asp 






175 




Gin 


Gin 


Gly 


Val 




190 






Phe 

rU X X W 


Glv 


Al a 


Pro 


2 05 








Gin 


Lys 


Asn 


Asn 


Tyr 


Glu 


Val 


Gly 








240 


Pro 


Ala 


Asn 


Ser 






255 




Val 


Ser 


Lys 


Asp 




270 






Kis 


Ser 


Glv 


Ala 


285 








Leu 


Leu 


Pro 


Glu 


Gly 


Tyr 


Asp 


Val 








320 


He 


Val 


lie 


Gly 






335 




Gly 


Ala 


Val 


Tyr 




350 






Lvs 


Pro 


He 


Ar a 


3 65 








Val 


Lys 


Asn 


3u. 


Val 


Gly 


Ala 


Pro 








4 00 


Ser 


Pro 


Thr 


Gly 






415 




Ser 


Pro 


Tyr 


Phe 




430 






Asn 


Ser 


Tyr 


Pro 


445 






He 


Phe 


Arg 


Ser 


Thr 




Asn 


Arg 








4 80 


Ser 


Gly 


lie 


Cys 






495 




Pro 


Ser 


Gly 


Tyr 




510 
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Asn 


Pro 


Pro 


lie 






515 




Arg 


Lys 


Ser 


Gly 




530 






Glu 


Pro 


Lvs 


Tvr 


54 5 








Ala 


Cys 


Met 


Glu 


Leu 


Arg 


Pro 


jl 








580 


Ser 


Arg 


Arg 


Arg 






595 




Ser 


Asn 


Glu 


Ala 




610 






Glv 


Cvs 


Glv 


Aso 


625 








Lys 


Phe 


Gly 


Thr 


lie 


Gin 


Lys 


Gly 








660 


Ala 


Leu 


Glu 


lie 






675 




Arg 


Lys 


Asp 


Gly 




690 






Pro 


Aso 


Thr 


Leu 


7 05 








Glu 


Lys 


Gin 


Leu 


Cys 


Glu 


Leu 


Gly 








740 


Leu 


lie 


Leu 


Ser 






755 




>i< ^5 


Asn 


Leu 


Lys 




770 






lie 


Thr 


Ala 


Lvs 


785 








Gly 


Val 


Ala 


Lys 


Glu 


Gin 


Ala 


Met 








820 


Glu 


Phe 


Arg 


Val 






835 




Ala 


Thr 


Leu 


Asn 




850 






U c; u. 


J_i \Z 


•tyx 


T nO 1 1 


865 








Cys 


Glu 


Pro 


His 


Asn 


Ser 


Arg 


Lys 








900 


Arg 


Lys 


Phe 


Ser 






915 




Ser 


Val 


Asn 


Val 




930 






Asp 


Thr 


Lys 


Ala 


94 5 








Phe 


Leu 


Glu 


Glu 


Ala 


Ser 


lie 


Asp 








980 


Ala 


Gly 


Thr 


Gin 






395 





Ser lie Leu Gly 

520 



Leu 


Ser 


Ser 


Arg 






535 




Thy 

J-. 1 1 j~ 






JU tr. i_i 




550 






Glu 


Thr 


Leu 


Trp 


565 






Pro 


He 


Thr 


Ala 


val 


Asn 


Ser 


Leu 








600 


Lys 


Thr 


Val 


Gin 






615 






A ^ n 


V d Jl. 






630 






Arg 


Glu 


Gly 


Asn 


645 








lie 


Pro 


Glu 


Leu 


Thr 


Val 


Thr 


Asn 








68 0 


Asp 


Asp 


Ala 


His 






695 




Thr 


Tvr 


Ser 

*w 


Ala 




710 






Ser 


Cys 


Val 


Ala 


725 








Asn 


Pro 


Phe 


Lvs 

wA 


Thr 


Thr 


Glu 


Val 








760 


Leu 


Glu 


Thr 


Thr 






775 




Ala 


Lvs 


Val 


Val 




790 






Pro 


Ser 


Gin 


Val 


805 








Lys 


Ser 


Glu 


Asp 


lie 


Asn 


Leu 


Gly 








840 


lie 


Gin 


Trp 


Pro 






855 








vaJ. 






870 






As n 


Glu 


He 


Asn 


885 








Lys 


Arg 


Glu 


Leu 


Leu 


Phe 


Pro 


Glu 








920 


Arc 


Cys 


Val 


Asn 






935 




Ser 


Leu 


Val 


Leu 




950 






Tyr 


Ser 


Lys 


Leu 


965 








Val 


Thr 


Ala 


Ala 


Val 


Arg 


Val 


Thr 








100 



7 



He 


Leu 


Glu 


Ala 


Val 


Gin 


Phe 


Arg 








54 0 


Thr 


Leu 


Asn 


Arg 






555 




Leu 


Gin 


Glu 


Asn 




57 0 






Ser 

Vr* 1-L— 


Val 


Glu 

\~J ^i-v 't--* 


He 

J. JL. V 


585 








Pro 


Glu 


Val 


Leu 


Thr 


Asp 


Val 


His 








620 


Asn 


Ser 


Asn 


Leu 






635 




Gin 


Asp 


Lys 


Phe 




650 






Val 


>U Ui. 


Lvs 




665 








Ser 


Pro 


Ser 


Asp 


Glu 


Ala 


Lys 


Leu 








700 


Tyr 


Arg 


Glu 


Leu 






715 




Asn 


Gin 


Asn 


Gly 




730 






Ara 


Asn 


Ser 


Ser 


745 








Thr 


Phe 


Asp 


Thr 


Ser 


Asn 


Gl n 


Asp 








780 


He 


Glu 


Leu 


Leu 






795 




Tyr 


Phe 


Gly 


Gly 




810 






Glu 


Val 


Glv 


i-J X- 


825 








Lys 


Pro 


Leu 


Lys 


Lys 


Glu 


He 


Ser 








860 


Ser 


Lys 


Gly 


Leu 






875 




Tyr 


Leu 


Lys 


Leu 




890 






Pro 


Glu 


Lys 


Gin 


905 








Arg 


Lys 


Tyr 


Gin 


He 


Arg 


Cys 


Pro 








94 0 


Cys 


Ser 


Arg 


Leu 






955 




Asn 


Tyr 


Leu 


Asp 




970 






Ala 


Gin 


Asn 


He 


985 








Val 


Phe 


Pro 


Ser 



Glu 

u J. \J. 


T n \j s 


J_ H 




525 








Asn 


Gin 


Gly 


Ser 


Gin 


Lys 


Gin 


Arg 








560 


He 


Arg 


Asp 


Lys 






575 




Gin 


Glu 


Pro 


Thr 




5 90 






"P y~ rs 


Tic 

JL _Lt? 


JLfC: Li 


Oil 


605 








Phe 


Leu 


Lys 


Glu 


Lys 


Leu 


Glu 


Tyr 








640 


Ser 


Tyr 


Leu 


Pro 






65 5 




Gin 


Lys 


Asp 


He 




670 






— JU -U 


ni. M 


A <=; n 




685 








lie 


Ala 


Thr 


Phe 


Arg 


Ala 


Phe 


Pro 








720 


Ser 


Gin 


A.1 a 


Asp 






735 




Val 


Thr 


Phe 


Tyr 




750 






Thr 


Aso 


Leu 

K** w*. 


Aso 


7 65 








Lys 


Leu 


Ala 


Pro 


Leu 


Ser 


Leu 


Ser 








8 00 


Thr 


Val 


Val 


Gly 






815 




Leu 


He 


Glu 


Tyr 




830 






Asn 


Leu 

-UJ W K*< 


Glv 


Thr 


845 






Asn 


Gly 


Lys 


Trp 


Glu 


Gin 


lie 


Val 








880 


Lys 


Glu 


Ser 


Hrs 






895 




He 


Asp 


Asp 


Ser 




910 






Thr 


Leu 


Asn 


Cys 


925 






Leu 


Arg 


Gly 


Leu 


Trp 


Asn 


Ser 


Thr 








960 


lie 


Leu 


Val 


Arg 






975 




Lys 


Leu 


Pro 


His 




9 90 






Lys 


Thr 


Val 


?\ 1 a 



1005 
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Gin 


Tyr 


Ser 


Gly 


Val 


Ala 


Trp Trp 




1010 








1015 


Gly 


He 


Leu 


Met 


Leu 


Ala 


Leu Leu 


1025 










1030 


Phe 


Phe 


Lys 


Arg 


Asn 


Lys 


Lys Asp 










104 5 




Ala 


Glu 


He 


His 


Thr 


Gin 


Pro Ser 








1060 






Ala 















8 



He 


He 


Leu 


Leu 


Ala 


Val 


Leu 


Ala 








1020 








Val 


Phe 


Leu 


Leu 


Trp 


Lys 


Cvs 


Gly 






1035 








104 


His 


Tyr 


Asp 


Ala 


Thr 


Tyr 


His 


Lys 




1050 








1055 


Asp 


Lys 


Glu 


Arg 


Leu 


Thr 


Ser 


Asp 


1065 










1070 



<210> 5 
<211> 5622 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description oi Artificial Sequence; Note = 
synthetic construct 

<4 00> 5 

atggcagggc cacgccccag cccatgggcc aggctgctcc tggcagcctt gatcagcgtc 60 

agcctctctg ggaccttgaa ccgctgcaag aaggccccag tgaagagctg cacggagtgt 120 

gtccgtgigg ataaggactg cgcctactgc acagacgaga tgttcaggga ccggcgctgc 180 

aacacccagg cggagctgct ggccgcgggc tgccagcggg agagcatcgt ggtcatggag 24 0 

agcagcttcc aaatcacaga ggagacccag attgacacca ccctgcggcg cagccagatg 300 

tccccccaag gcctgcgggt ccgtctgcgg cccggtgagg agcggcattt fcgagctggag 360 

gtgtttgagc cactggagag ccccgtggac ctgtacatcc tcatggactt ctccaactcc 420 

atgtccgatg atctggacaa cctcaagaag atggggcaga acctggctcg ggtcctgagc 480 

cagctcacca gcgactacac tattggattt ggcaagtttg tggacaaagt cagcgtcccg 54 0 

cagacggaca tgaggcctga gaagctgaag gagccttggc ccaacagtga cccccccttc 600 

tccttcaaga acgtcatcag cctgacagaa gatgtggatg agttccggaa taaactgcag 660 

ggagagcgga tctcaggcaa cctggatgct cctgagggcg gcttcgatgc catcctgcag 720 

acagctgtgt gcacgaggga cattggctgg cgcccggaca gcacccacct gctggtcttc 780 

tccaccgagt cagccttcca ctatgaggct gatggcgcca acgtgctggc tggcatcatg 840 

agccgcaacg atgaacggtg ccacctggac accacgggca cctacaccca gtacaggaca 900 

caggactacc cgtcggtgcc caccctggtg cgcctgctcg ccaagcacaa catcaucccc 960 

atctttgctg tcaccaacta ctcctatagc tactacgaga agcttcacac ctatttccct 1020 

gtctcctcac tgggggtgct gcaggaggac tcgtccaaca tcgtggagct gctggaggag 1080 

gccttcaatc ggatccgctc caacctggac atccgggccc tagacagccc ccgaggcctt 1140 

cggacagagg tcacctccaa gatgttccag aagacgagga ctgggtcctt tcacatccgg 1200 

cggggggaag tgggtatata ccaggtgcag ctgcgggccc ttgagcacgt ggatgggacg 1260 

cacgtgtgcc agctgccgga ggaccagaag ggcaacatcc atctgaaacc ttccttctcc 1320 

gacggcctca agatggacgc gggcatcatc tgtgatgtgt gcacctgcga gctgcaaaaa 1380 

gaggtgcggt cagctcgctg cagcttcaac ggagacttcg tgtgcggaca gtgtgtgtgc 14 40 

agcgagggcfc ggagtggcca gacctgcaac tgctccaccg gctctctgag tgacattcag 1500 

ccctgcctgc gggagggcga ggacaagccg tgctccggcc gtggggagtg ccagtgcgqg 1560 

cactgtgtgfc gcfeacggcga aggccgctac gagggtcagt tctgcgagta tgacaacfctc 1620 

cagtgtcccc gcacttccgg gttcctctgc aatgaccgag gacgctgctc catgggccag 1680 

tgtgtgtgtg agcctggttg gacaggccca agctgtgact gtcccctcag caatgccacc 1740 

tgcatcgaca gcaatggggg catctgtaat ggacgtggcc actgtgagtg tggccgctcc 1800 

cactgccacc agcagtcgct ctacacggac accatctgcg agatcaacta ctcggcgtcc 1860 

acccgggcct ctgcgaggac ctacgctcct gcgtgcagtg ccaggcgtgg ggcaccggcg 1920 

agaagaaggg gcgcacgtgt gaggaatgca acttcaaggt caagatggtg gacgagctta 1980 

agagaggcga ggaggtggtg gtgcgctgct ccttccggga cgaggatgac gactgcacct 2040 

acagctacac catggaaggt gacggcgccc ctgggcccaa cagcactgtc ctggtgcaca 2100 

agaagaaggg actgccctcc gggctccttc tggtggctca fcccccctgcfc cctcctcctc 2160 

ctgccgctcc tggccctgct actgctgcta tgctggaagt actgtgcctg ctgcaaggcc 2220 

rgcctggcac ttctcccgtg ctgcaaccga ggtcacatgg tgggcttfcaa ggaagaccac 2280 

tacatgctgc gggagaacct gatggcctcfc gaccacttgg acacgcccat gctgcgcagc 2340 

gggaacctca agggccgtga cgtggtccgc tggaaggtca ccaacaacat gcagcggcct 2 4 00 

ggcttfcgcca ctcatgccgc cagcatcaac cccacagagc tggtgcccta cgggctgtcc 24 60 
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ttgcgcctgg cccgcctttg caccgagaac ctgctgaagc ctgacactcg ggagtgcgcc 2520 

cagctgcgcc aggaggtgga ggagaacctg aacgaggtct acaggcagat ctccggtgta 2580' 

cacaagctcc agcagaccaa gttccggcag cagcccaatg ccgggaaaaa gcaagaccac 2640 

accattgtgg acacagtgct gatggcgccc cgctcggcca agccggccct gctgaagctfc 2700 

acagagaagc aggtggaaca gagggccttc cacgacctca aggtggcccc cggctactac 2760 

accctcactg cagaccagga cgcccggggc atggtggagt tccaggaggg cgtggagctg 2820 

gtggacgtac gggtgcccct ctttatccgg cctgaggatg acgacgagaa gcagcfcgctg 2880 

gtggaggcca tcgacgtgcc cgcaggcact gccaccctcg gccgccgcct ggtaaacatc 2940 

accatcatca aggagcaagc cagagacgtg gtgtcctfctg agcagcctga gttctcggtc 3000 

agccgcgggg accaggtggc ccgcatccct gtcatccggc gtgtcctgga cggcgggaag 3060 

tcccaggtct cctaccgcac acaggatggc accgcgcagg gcaaccggga ctacatcccc 3120 

gtggagggtg agctgctgtt ccagcctggg gaggcctgga aagagctgca ggtgaagctc 3180 

ctggagctgc aagaagttga ctccctcctg cggggccgcc aggtccgccg tttccacgfcc 3240 

cagctcagca accctaagtt tggggcccac ctgggccagc cccactccac caccatcatc 3300 

atcagggacc cagatgaact ggaccggagc ttcacgagtc agatgthgtc atcacagcca 3350 

ccccctcacg gcgacctggg cgccccgcag aaccccaatg ctaaggccgc tgggtccagg 3420 

aagatccatt tcaactggct gcccccttct ggcaagccaa tggggtacag ggtaaagtac 3480 

tggattcagg gtgactccga atccgaagcc cacctgctcg acagcaaggt gccctcagtg 3540 

gagctcacca acctgbaccc gtattgcgac tatgagatga aggtgtgcgc ctacggggct 3600 

cagggcgagg gaccctacag ctccctggtg tcctgccgca cccaccagga agtgcccagc 3660 

gagccagggc gtctggcctt caatgtcgtc tcctccacgg tgacccagct gagctgggct 3720 

gagccggctg agaccaacgg tgagatcaca gcctacgagg tcfcgctatgg cctggtcaac 3780 

gatgacaacc gacctattgg gcccatgaag aaagtgctgg ttgacaaccc taagaaccgg 3840 

atgctgctta ttgagaacct tcgggagtcc cagccctacc gctacacggt gaaggcgcgc 3900 

aacggggccg gctgggggcc tgagcgggag gccatcatca acctggccac ccagcccaag 3960 

aggcccatgt ccatccccat catccctgac atccctatcg tggacgccca gagcggggag 4020 

gactacgaca gcttccttat gtacagcgat gacgttctac gctctccatc gggcagccag 4080 

aggcccagcg tctccgatga cactggctgc ggctggaagt tcgagcccct gctgggggag 414 0 

gagctggacc tgcggcqcgfc cacgtggcgg ctgcccccgg agctcatccc gcgcctgtcg 4200 

gccagcagcg ggcgctcctc cgacgccgag gcgccccacg ggcccccgga cgacggcggc 4260 

gcgggcggga agggcggcag cctgccccgc agtgcgacac ccgggccccc cggagagcac 4320 

ctggtgaatg gccggatgga ctttgccttc ccgggcagca ccaactccct gcacaggatg 4380 

accacgacca gtgctgctgc ctatggcacc cacctgagcc cacacgtgcc ccaccgcgtg 4440 

ctaagcacat cctccaccct cacacgggac tacaactcac tgacccgctc agaacactca 4500 

cactcgacca cactgccgag ggactactcc accctcacct ccgtctcctc ccacggcctc 4560 

cctcccatct gggaacacgg caggagcagg cttccgctgt cctgggccct ggggtcccgg 4620 

agtcgggctc agatgaaagg gttcccccct tccaggggcc cacgagactc tataatcctg 4680 

gctgggaggc cagcagcgcc ctcctggggc ccagactctc gcctgactgc tggtgtgccc 4740 

gacacgccca cccgcctggt gttctcfcgcc ctggggccca catctctcag agtgagctgg 4800 

caggagccgc ggtgcgagcg gccgctgcag ggctacagtg tggagtacca gctgctgaac 48 60 

ggcggtgagc tgcatcggct caacatcccc aaccctgccc agacctcggt ggtggtggaa 4920 

gacctcctgc ccaaccactc ctacgtgttc cgcgtgcggg cccagagcca ggaaggctgg 4980 

ggccgagagc gfcgagggtgt catcaccafct gaatcccagg tgcacccgca gagcccactg 5040 

tgtcccctgc caggctccgc cttcactttg agcactccca gtgccccagg cccgctggtg 5100 

ttcactgccc tgagcccaga ctcgctgcag ctgagctggg agcggccacg gaggcccaat 5160 

gggg^tatcg tcggctacct ggtgaccfcgt gagatggccc aaggaggagg tccagccacc 5220 

gcattccggg tggatggaga cagccccgag agccggctga ccgtgccggg cctcagcgag 5280 

aacgtgccct acaagttcaa ggtgcaggcc aggaccactg agggcttcgg gccagagcgc 5340 

gagggcatca tcaccataga gtcccaggat ggaggtccct tcccgcagct gggcagccgfc 5400 

gccgggctct tccagcaccc gctgcaaagc gagtacagca gcatctccac cacccacacc 5460 

agcgccaccg agcccttcct agtgggtccg accctggggg cccagcacct ggaggcaggc 5520 

ggctccctca cccggcatgt gacccaggag tttgtgagcc ggacactgac caccagcgga 5580 

acccttagca cccacatgga ccaacagttc ttccaaactt: ga " " " 5622 



<210> 6 
<21X> 1873 
<212> PRT 

<213> Artificial Sequence 



<22 0> 
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<223> Description of Artificial 
synthetic construct 



<400> 6 














Met 


Ala 


Gly 


Pro 


Arg 


Pro 


Ser 


Pro 


1 








5 








Leu 


lie 


Ser 


Val 


Ser 


Leu 


Ser 


Gly 








20 








Pro 


Val 


Lys 


Ser 


Cys 


Thr 


Glu 


Cys 






35 










4 0 


Tyr 


C vs 


Thr 


Asp 


Glu 


Met 


Phe 


Arg 




50 










55 


Glu 


Leu 


Leu 


Ala 


Ala 


Gly 


Gys 


Gin 


65 










70 






Ser 


Ser 


Phe 


Gin 


lie 


Thr 


Glu 


Glu 










85 








Arg 


Ser 


Gin 




Ser 


Pro 


Gin 


Gly 








100 










Glu 


Glu 


Arg 


His 


Phe 


Glu 


Leu 


Glu 






115 










120 


val 


Asp 


Leu 


Tyr 


lie 


Leu 


Met 


Asp 




130 










135 




Leu 


Asp 


Asn 


Leu 


Lys 


Lys 


Met 


Gly 


145 










15 0 






Gin 


Leu 


Thr 


Ser 


Asp 


Tyr 


Thr 


lie 










165 








Val 


Ser 


Val 


Pro 


Gin 


Thr 


Asp 


Met 








180 








Trp 


Pro 


Asn 


Ser 


Asp 


Pro 


Pro 


Phe 






195 










200 


Thr 


Glu 


Asp 


Val 


Asp 


Glu 


Phe 


Arg 




210 








215 


Ser 


Gly 


Asn 


Leu 


Asp 


Ala 


Pro 


Glu 


£ 2 5 










230 






Thr 


Ala 


Val 


Cys 


Thr 


Arg 


Asp 


lie 










245 








Leu 


Leu 


Val 


Phe 


Ser 


Thr 


Glu 


Ser 








2 60 










Ala 


Asn 


Val 


Leu 


Al a 


Gly 


lie 


Met 






275 










280 


Leu 


Asp 


Thr 


Thr 


Gly 


Thr 


Tyr 


Thr 




290 










295 




Ser 


Val 


Pro 


Thr 


Leu 


Val 


Arg 


Leu 


305 










310 




lie 


Phe 


Ala 


Val 


Thr 


Asn 


Tyr 


Ser 










325 








Thr 


Tyr 


Phe 


Pro 


Val 


Ser 


Ser 


Leu 








34 0 










Asn 


lie 


Val 


Glu 


Leu 


Leu 


Glu 


Glu 






355 










360 


Leu 


Asp 


lie 


Arg 


Ala 


Leu 


Asp 


Ser 




370 










37 5 




Thr 


Ser 


Lys 


Met 


Phe 


Gin 


Lys 


Thr 


385 










390 






Arg 


Gly 


Glu 


Val 


Gly 


lie 


Tyr 


Gin 










405 








Val 


Asp 


Gly 


Thr 


His 


Val 


Cys 


Gin 








420 








lie 


His 


Leu 


Lys 


Pro 


Ser 


3? 3r^. 


Ser 






435 










4 40 


lie 


lie 


Cys 


Asp 


Val 


Cys 


Thr 


Cys 




450 








4 C <5 





10 

Sequence; Note = 



Trp 


Ala 


Arg 


Leu 


Leu 


Leu 


Ala 


Ala 




10 










15 




Thr 


Leu 


Asn 


Arg 


Cys 


Lys 


Lys 


Ala 


25 










30 






Val 


Arg 


Val 


Asp 


Lys 
45 


Asp 


Cys 


Ala 


Asp 


Arg 


Arg 


Cys 


Asn 


Thr 


Gin 


Ala 








60 










Arg 


Glu 


Ser 


lie 


Val 


Val 


Met 


Glu 






75 










80 


Thr 


Gin 


lie 


Asp 


Thr 


Thr 


Leu 


Arg 




90 










95 




Leu 


Arg 


Val 


Arg 


Leu 


Arq 


Pro 


Gly 


105 










110 






Val 


Phe 


Glu 


Pro 


Leu 


Glu 


Ser 


Pro 










125 








Phe 


Ser 


Asn 


Ser 


Met 


Ser 


Asp 


Asp 








140 










Gin 


Asn 


Leu 


Ala 


Arg 


Val 


Leu 


Ser 






155 










160 


Gly 


Phe 


Gly 


Lys 


Phe 


Val 


Asp 


Lys 




170 










175 




Arg 


Pro 


Glu 


Lys 


Leu 


Lys 


Glu 


Pro 


185 










190 






Ser 


Phe 


Lys 


Asn 


Val 


He 


Ser 


Leu 










2 05 








Asn 


Lys 


Leu 


Gin 


Gly 


Glu 


Arg 


He 








220 










Gly 


Gly 


Phe 


Asp 


Ala 


He 


Leu 


Gin 






235 










240 


Gly 


Trp 


Arg 


Pro 


Asp 


Ser 


Thr 


His 




250 










255 




Ala 


Phe 


His 


Tyr 


Glu 


Ala 


Asp 


Gly 


265 










27 0 






Ser 


Arg 


Asn 


Asp 


Glu 


Arg 


Cys 


His 










285 








Gin 


Tyr 


Arg 


Thr 


Gin 


Asp 


Tyr 


Pro 








300 










Leu 


Ala 


Lys 


His 


Asn 


He 


He 


Pro 






315 










320 


Tyr 


Ser 


Tyr 


Tyr 


Glu 


Lys 


Leu 


His 




330 










335 




Gly 


Val 


Leu 


Gin 


Glu 


Asp 


Ser 


Ser 


3 4 w' 










350 






Ala 


Phe 


Asn 


Arg 


lie 


Arg 


Ser 


Asn 










365 








Pro 


Arg 


Gly 


Leu 


Arg 


Thr 


Glu 


Val 








380 










Arg 


Thr 


Gly 


Ser 


Phe 


His 


He 


Arg 






395 










4 00 


Val 


Gin 


Leu 


Arg 


Ala 


Leu 


Glu 


His 




410 








415 




Leu 


Pro 


Glu 


Asp 


Gin 


Lys 


Gly 


Asn 


425 










430 




Asp 


Gly 


Leu 


Lys 


Met 


Asp 


Ala 


Gly 










445 








Glu 


Leu 


Gin 


Lys 


Glu 


Val 


Arg 


Ser 



460 
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Ala Arg Cys Ser 
4 55 

Ser Glu Gly Trp 

Ser Asp lie Gin 

500 

Gly Arg Gly Glu 
515 

Arg Tyr Glu Gly 
530 

Thr Ser Gly Phe 
545 

Cys Val Cys Glu 

Ser Asn Ala Thr 

58 0 

Gly His Cys Glu 
595 

Thr Asp Thr lie 
610 

Ala Arg Thr Tyr 
625 

Arg Arg Arg Gly 

Trp Thr Ser Leu 

660 

Gly Thr Arg Met 
675 

Ala Pro Leu Gly 
690 

Cys Pro Pro Gly 
705 

Leu Pro Leu Leu 

Cys Cys Lys Ala 

740 

Met Val Gly Phe 
755 

Ala Ser Asp His 
770 

Gly Arg Asp Val 
785 

Gly Phe Ala Thr 

Tyr Gly Leu Ser 

820 

Lys Pro Asp Thr 
835 

Asn Leu Asn Glu 
850 

Gin Thr Lys Phe 
865 

Thr rie Val Asp 

Leu Leu Lys Leu 

900 

Leu Lys Val Ala 
915 

Arg Gly Met Val 
930 

Val Pro Leu Phe 
945 



Phe Asn Gly Asp 
470 

Ser Gly Gin Thr 
485 

Pro Cys Leu Arg 

Cys Gin Cys Gly 

520 

Gin Phe Cys Glu 
535 

Leu Cys Asn Asp 
550 

Pro Gly Trp Thr 
565 

Cys lie Asp Ser 

Cys Gly Arg Cys 

600 

Gys Glu lie Asn 
615 

Ala Pro Ala Cys 
630 

Ala Arg Val Arg 
645 

Arg Glu Ala Arg 

Thr Thr Ala Pre 

680 

Pro Thr Ala Leu 
695 

Ser Phe Trp Trp 
710 

Ala Leu Leu Leu 
725 

Cys Leu Ala Leu 

Lys Glu Asp His 

7 60 

Leu Asp Thr Pro 
775 

Val Arg Trp Lys 
7 90 

His Ala Ala Ser 
805 

Leu Arg Leu Ala 

Arg Glu Cys Ala 

840 

Val Tvr Arg Gin 
855 

Arg Gin Gin Pro 
870 

Thr Val Leu Met 
885 

Thr Glu Lys Gin 

Pro Gly Tyr Tyr 

920 

Glu Phe Gin Glu 
935 

lie Arg Pro Glu 
950 



11 

Phe Val Cys Gly 
475 

Cys Asn Cys Ser 
4 90 

Glu Gly Glu Asp 
505 

His Cys Val Cys 

Tyr Asp Asn Phe 

540 

Arg Gly Arg Cys 
555 

Gly Pro Ser Cys 
570 

Asn Gly Gly lie 
585 

His Cys His Gin 

Tyr Ser Ala Ser 

62 0 

Ser Ala Arg Arg 
635 

Asn Ala Thr Ser 
650 

Arg Trp Trp Cys 
665 

Thr Ala Thr Pro 

Ser Trp Cys Thr 

7 00 

Leu lie Pro Leu 
715 

Leu Leu Cys Trp 

7 30' 

Leu Pro Cys Cys 
745 

Tyr Met Leu Arg 

Met Leu Arg Ser 

780 

Val Thr Asn Asn 
795 

lie Asn Pro Thr 
810 

Arg Leu Cys Thr 
825 

Gin Leu Arg Gin 

lie Ser Gly Val 

8 60 

Asn Ala Gly Lys 
875 

Ala Pro Arg Ser 

8 90 

Val Glu Gin Arg 
905 

Thr Leu Thr Ala 

Gly Val Glu Leu 

940 

Asp Asp Asp Glu 
955 



Gin Cys Val Cys 

480 

Thr Gly Ser Leu 

495 

Lys Pro Cys Ser 
510 

Tyr Gly Glu Gly 
525 

Gin Cys Pro Arg 

Ser Met Gly Gin 

5 60 

Asp Cys Pro Leu 
575 

Cys Asn Gly Arg 
5 90 

Gin Ser Leu Tyr 
605 

Thr Arg Ala Ser 

Gly Ala Pro Ala 

64 0 

Arg Ser Arg Trp 
655 

Ala Ala Pro Ser 
670 

Trp Lys Val Thr 
685 

Arg Arg Arg Asp 

Leu Leu Leu Leu 

720 

Lys Tyr Cys Ala 
735 

Asn Arg Gly His 
750 

Glu Asn Leu Met 
765 

Gly Asn Leu Lys 

Met Gin Arg Pro 

800 

Glu Leu Val Pro 
815 

Glu Asn Leu Leu 
830 

Glu Val Glu Glu 
845 

His Lys Leu Gin 

Lys Gin Asp His 

880 

Ala Lys Pro Ala 
895 

Ala Phe His Asp 
910 

Asp Gin Asp Ala 
925 

Val Asp Val Arg 

Lys Gin Leu Leu 

960 
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o 



Val Glu Ala He Asp Val Pro Ala Gly Thr Ala Thr Leu Gly Arg Arg 

965 970 975 

Leu Val Asn He Thr lie lie Lys Glu Gin Ala Arg Asp Val Val Ser 

980 985 990' 

Phe Glu Gin Pro Glu Phe Ser Val Ser Arg Gly Aso Gin Val Ala Arg 

995 1000 ~ 1005 

He Pro Val He Arg Arg Val Leu Asp Gly Gly Lys Ser Gin Val Ser 

1010 1015 1020 

Tyr Arg Thr Gin Asp Gly Thr Ala Gin Gly Asn Arg Asp Tyr He Pro 
1025 1030 1035 1040 

Val Glu Gly Glu Leu Leu Phe Gin Pro Gly Glu Ala Trp Lys Glu Leu 

104 5 1050 105 5 

Gin Val Lys Leu Leu Glu Leu Gin Glu Val Asp Ser Leu Leu Arg Gly 

1060 1065 1070 

Arg Gin Val Arg Arg Phe His Val Gin Leu Ser Asn Pro Lys Phe Gly 

1075 1080 1085 

Ala His Leu Gly Gin Pro His Ser Thr Thr He He He Arg Asp Pro 

1090 1095 1100 

Asp Glu Leu Asp Arg Ser Phe Thr Ser Gin Met Leu Ser Ser Gin Pro 
1105 mo 1H5 H2D 

Pro Pro His Gly Asp Leu Gly Ala Pro Gin Asn Pro Asn Ala Lys Ala 

1125 1130 1135 

Ala Gly Ser Arg Lys He His Phe Asn Trp Leu Pro Pro Ser Gly Lys 

1140 1145 1150 

Pro Met Gly Tyr Arg Val Lys Tyr Trp He Gin Gly Asp Ser Glu Ser 

1155 1160 1165 

Glu Ala His Leu Leu Asp Ser Lys Val Pro Ser Val Glu Leu Thr Asn 

1170 1175 1180 

Leu Tyr Pro Tyr Cys Asp Tyr Glu Met Lys Val Cys Ala Tyr Gly Ala 
1185 1190 1195 ' 1200 

Gin Gly Glu Gly Pro Tyr Ser Ser Leu Val Ser Cys Arg Thr His Gin 

1205 1210 1215 

Glu Val Pro Ser Glu Pro Gly Arg Leu Ala Phe Asn Val Val Ser Ser 

1220 1225 1230 

Thr Val Thr Gin Leu Ser Trp Ala Glu Pro Ala Glu Thr Asn Gly Glu 

1235 1240 1245 

He Thr Ala Tyr Glu Val Cys Tyr Gly Leu Val Asn Asp Asp Asn Arg 

1250 1255 1260 

Pro lie Gly Pro Met Lys Lys Val Leu Val Asp Asn Pro Lys Asn Arg 
1265 1270 1275 1280 

Met Leu Leu lie Glu Asn Leu Arg Glu Ser Gin Pro Tyr Arg Tyr Thr 

1285 1290 1295 

Val Lys Ala Arg Asn Gly Ala Gly Trp Gly Pro Glu Arg Glu Ala He 

1300 1305 1310 

He Asn Leu Ala Thr Gin Pro Lys Arg Pro Met Ser He Pro He He 

1315 1320 1325 

Pro Asp He Pro He Val Asp Ala Gin Ser Gly Glu Asp Tyr Asp Ser 

1330 1335 1340 

Phe Leu Met Tyr Ser Asp Asp Val Leu Arg Ser Pro Ser Gly Ser Gin 
1345 1350 1355 1360 

Arg Pro Ser Val Ser Asp Asp Thr Gly Cys Gly Trp Lys Phe Glu Pro 

1365 1370 " 1375 

Leu Leu Gly Glu Glu Leu Asp Leu Arg Arg Val Thr Trp Arg Leu Pro 

1380 1385 1390 

Pro Glu Leu He Pro Arg Leu Ser Ala Ser Ser Gly Arg Ser Ser Asp 

1395 1400 1405 

Ala Glu Ala Pro His Gly Pro Pro Asp Asp Gly Gly Ala Gly Gly Lys 

1410 1415 1420 

Gly Gly Ser Leu Pro Arg Ser Ala Thr Pro Gly Pro Pro Gly Glu His 
1425 1430 1435 1440 

Leu Val Asn Gly Arg Met Asp Phe Ala Fhe Pro Gly Ser Thr Asn Ser 

1445 1450 1455 
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Leu His Arg Met Thr Thr Thr Ser Ala Ala Ala Tyr Gly Thr His Leu 

1460' 1465 1470 

Ser Pro His Val Pro His Arg Val Leu Ser Thr Ser Ser Thr Leu Thr 

1475 1480 1485 

Arg Asp Tyr Asn Ser Leu Thr Arg Ser Glu His Ser His Ser Thr Thr 

1490 1495 1500' 

Leu Pro Arg Asp Tyr Ser Thr Leu Thr Ser Val Ser Ser His Gly Leu 
1505 1510 1515 1520 

Pro Pro lie Trp Glu His Gly Arg Ser Arg Leu Pro Leu Ser Trp Ala 

1525 1530 1535 

Leu Gly Ser Arg Ser Arg Ala Gin Met Lys Gly Phe Pro Pro Ser Arg 

1540 1545 1550 

Gly Pro Arg Asp Ser He He Leu Ala Gly Arg Pro Ala Ala Pro Ser 

1555 1560 1565 

Trp Gly Pro Asp Ser Arg Leu Thr Ala Gly Val Pro Asp Thr Pro Thr 

1570 1575 ' 1580 

Arg Leu Val Phe Ser Ala Leu Gly Pro Thr Ser Leu Arg Val Ser Trp 
1585 1590 1595 1600 

Gin Glu Pro Arg Cys Glu Arg Pro Leu Gin Gly Tyr Ser Val Glu Tyr 

1605 1610 1615 

Gin Leu Leu Asn Gly Gly Glu Leu His Arg Leu Asn Tie Pro Asn Pro 

1620 1625 1630 

Ala Gin Thr Ser Val Val Val Glu Asp Leu Leu Pro Asn His Ser Tyr 

1635 1640 1645 

Val Phe Arg Val Arg Ala Gin Ser Gin Glu Gly Trp Gly Arg Glu Arg 

1650 1655 1660 

Glu Gly Val lie Thr He Glu Ser Gin Val His Pro Gin Ser Pro Leu 
1665 1670 1675 1680 

Cys Pro Leu Pro Gly Ser Ala Phe Thr Leu Ser Thr Pro Ser Ala Pro 

1685 1690 1695 

Gly Pro Leu Val Phe Thr Ala Leu Ser Pro Asp Ser Leu Gin Leu Ser 

1700 1705 1710 

Trp Glu Arg Pro Arg Arg Pro Asn Gly Asp He Val Gly Tvr Leu Val 

1715 1720 1725 

Thr Cys Glu Met Ala Gin Gly Gly Gly Pro Ala Thr Ala Phe Arg Val 

1730 1735 1740 

Asp Gly Asp Ser Pro Glu Ser Arg Leu Thr Val Pro Gly Leu Ser Glu 
1745 1750 1755 1760 

Asn Val Pro Tyr Lys Phe Lys Val Gin Ala Arg Thr Thr Glu Gly Phe 

1765 1770 1775 

Gly Pro Glu Arg Glu Gly He He Thr He Glu Ser Gin Asp Gly Gly 

1780 1785 1790 

Pro Phe Pro Gin Leu Gly Ser Arg Ala Gly Leu Phe Gin His Pro Leu 

1795 1800 1805 

Gin Ser Glu Tyr Ser Ser He Ser Thr Thr His Thr Ser Ala Thr Glu 

1810 1815 1820 

Pro Phe Leu Val Gly Fro Thr Leu Gly Ala Gin His Leu Glu Ala Gly 
1825 1830 1835 1840 

Gly Ser Leu Thr Arg His Val Thr Gin Glu Phe Val Ser Arg Thr Leu 

1845 1850 " 1855 

Thr Thr Ser Gly Thr Leu Ser Thr His Met Asp Gin Gin Phe Phe Gin 

1860 1865 1870 

Thr 



<210> 7 
<211> 5907 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note « 
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synthetic construct 



<4 00> 7 

ggaccgtcga ggcagcggga ctgacccagc tgggctcact gtattaagaa gcggacccgc 60 

gacccggagc gcccggggac ccgatctggg agcctggacg ggfcgcagcgc gcaggaatgc 120 

agtccgcctg actcaccagc gcctccttcc tacctgcgcc gcccgtccat aaagcgctgc 180' 

tcgtcccgcc cgccgccgcc gccctgctgt cccgccgggc tcgcccgcgc gctcagctcg 240 

acccaacgca gcccaagtcc gaggtagtct cactaaggag gaggaggatg gcagggccct 300 

gttgcagccc atgggtgaag ctgctgctgc tggcacgaafc gctgagtgcc agcctccctg 360 

gagacctggc caaccgctgc aagaaggctc aggtgaagag ctgtaccgag tgcatccggg 420 

tggacaacag ctgtgcctac tgcacagacg agctgttcaa ggagaggcgc tgcaacaccc 480 

aggcggacgt tctggctgca ggctgcaggg gagagagcat cctggtcatg gagagcagcc 540 

ttgaaatcac agagaacacc cagatcgtca ccagcctgca ccgcagccag gtatctcccc 600 

aaggcctgca agtccggctg cggcggggtg aggagcgcac gtttgtgttc caggtctttg 660 

agcccctgga gagccccgtg gatctgtata tcctcatgga cttctccaac tccatgtctg 720 

acgatctgga caacctcaag cagatggggc agaacctggc caagatcctg cgccagctca 780' 

ccagcgacta caccattgga tttggaaagt ttgtggacaa agtcagcgtc ccacagacag 84 0 

acatgaggcc cgagaaactg aaggagccct ggcccaacag tgatcccccg ttctccttca 900' 

agaacgttat cagcttaacg gagaatgtgg aagaattctg gaacaaactg caaggagaac 960 

gcatctcagg caacctggac gctcctgaag ggggctttga tgccatcctg cagacagctg 1020 

tgtgcacaag ggacattggc tggagggctg acagcaccca cctgctggtg ttctccaccg 1080 

agtctgcctt ccactacgag gctgatggtg ccaacgttct ggccggcatc atgaaccgca 114 0 

atgatgagaa atgccacctg gacgcctcgg gcgcctacac ccaatacaag acacaggact 1200 

acccatcagt gcccacgctg gttcgcctgc ttgccaagca taacatcatc cccatctttg 1260 

ctgtcaccaa ctactcttac agctactatg agaagctcca taagtatttc cccgtctcct 1320 

ctctgggcgt cctgcaggag gattcatcca acatcgtgga gctgctggag gaggccttcfc 1380 

atcgaattcg ctccaacctg gacaiccggg ctctggacag ccccagaggc ctgagaacag 144 0 

aggtcacctc cgatactctc cagaagacgg agactgggtc ctttcacatc aagcgggggg 1500 

aagtgggcac atacaatgfcg catctccggg cagtggagga catagatggg acacatgtgfc 1560 

gccagcfcggc taaagaagac caagggggca acatccacct gaaaccctcc ttctctgatg 1620 

gcctccggat ggacgcgagt gtgatctgig acgtgtgccc ctgtgagctg caaaaggaag 1680 

ttcgatcagc tcgctgtcac ttcagaggag acttcatgtg tggacactgt gtgtgcaatg 1740 

agggctggag tggcaaaacc tgcaactgct ccaccggctc tctgagtgac acacagccct 1800 

gcctgcgtga gggtgaggac aaaccgtgct cgggccacgg cgagtgccag tgcggacgct 18 60 

gtgtgtgcta tggtgaaggc cgctacgagg gtcacttctg cgagtatgac aacttccagt 1920 

gtccccggac ctctggattc ctgtgcaatg accggggacg ctgttctatg ggagagtgtg 1980 

tgtgtgagcc tggttggaca ggccgcagct gcgactgtcc cctcagcaat gccacctgca 2040 

tcgatagcaa cgggggcatc tgcaacggcc gaggctactg tgagtgtggc cgttgtcact 2100 

gcaaccagca gtcgctctac acggacacca cctgtgagat caactactct gcgatactgg 2160 

gtctctgtga ggatctccgc tcctgcgfcac agtgccaggc ctggggcacc ggggagaaga 2220 

aagggcgcgc gtgtgacgat tgccccttta aagtcaagat ggtagacgag cttaagaaag 2280 

aagaggtggt ggagtactgc tccttccggg atgaggatga cgactgcact tacagctaca 2340 

acgtggaggg cgacggcagc cctgggccca acagcacagt cctggtccac aaaaagaaag 2400 

actgcctccc ggctccttcc tggtggctca tccccctgct catcttcctc ctgttgctcc 2460 

tggcgttgct tctgctgctc tgctggaaat actgtgcctg ctgcaaagcc tgcctggggc 2520 

fctcttccttg ctgcaaccga ggtcacatgg tgggctttaa ggaagatcac tatatgcfctc 2580 

gggagaacct gatggcctct gaccacctgg acacgcccat gctacgaagc gggaacctca 2640 

agggacgaga cacagtccgc tggaagatca ccaacaatgt gcagcgccct ggctttgcca 2700 

cccatgccgc cagcaccagc cccacggagc tcgtacccta cgggctgtcc ctgcgccttg 27 60 

gccgcctctg cactgagaac cttatgaagc cgggcacccg agagtgtgac cagctacgcc 2820 

aggaggtgga ggaaaatctg aatgaggtgt atagacaggt cagcggcgca cacaagctcc 2880 

agcagacgaa gfctccgacag cagcccaacg ccgggaaaaa gcaagaccac accattgtgg 2940 

acacagtgtt gcfcggcgccc cgctccgcca agcagatgct gctgaagctg acagagaagc 3000 

aggtggagca ggggtccttc catgaactga aggtggcccc tggctactac actgtcacgg 3060 

cagagcagga tgcccggggc atggtggagt tccaggaggg cgtggagctg gtgga fcgtgc 3120 

gagtgcccct cttcatccgg cctgaggatg atgatgagaa gcagctgctg gtggaggcca 3180 

ttgatgtccc tgfcgagcact gccacccttg gtcgccgfcct ggtaaacatc accattatca 3240 

aggaacaagc tagtggggta gtgtccttcg agcagcctga atactcggtg agtcgtggag 3300 

^ccaggtggc ccgcatccct gtcatccggc acatcctgga caatggcaag tcccaggtct 3360' 

cctatagcac acaggataat acagcacacg gacaccggga fctatgttccc gtggagggac 3420 

agctgctgtt ccatcctggg gaaacctgga aggagttgca ggtgaagcta ctggagcfcgc 3430 

aggaggttga ctccctcctg cgtggccgcc aggtccgccg cttccaagtc caactcagca 3540 
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accccaagtt cggagcccgc ctgggccagc ccagcacaac caccgttatt ctcgatgaaa 36Q0 

cggacaggag tctcataaat caaacacttt catcgcctcc gccaccccat ggagacctgg 3660 

gcgccccaca gaaccccaat gccaaggctg ccggatccag gaagatccat ttcaactggc 3720 

tgccccctcc tggcaagcca atggggtaca gggtgaagta ctggatccag ggcgactctg 3780 

aatctgaagc ccaccttcta gatagcaagg tgccctcagt ggaactcacc aacctgtatc 3840 

cctattgcga ctacgaaatg aaggtgtgtg cctatggggc caagggtgag gggccctata 3900 

gctcactggt gtcctgccgc acccaccagg aagtacccag tgagccaggg aggctggctt 3960 

tcaatgtagt ctctfcctacg gtgactcagc tgagctgggc agagccagct gagaccaatg 4020 

gcgagatcac agcctacgag gtctgctatg gactggtcaa tgaggacaac agacccattg 4080 

gacctatgaa gaaggtgctc gtggacaacc ccaagaaccg gatgctgctc attgagaatc 4140 

tgcgagattc ccagccatac cgatacacgg ttaaggcgcg caatggggca ggatggggac 4200 

ccgagagaga ggctatcatc aacctggcta cacagcccaa gcggcccatg tccatcccta 42 60 

tcatcccaga catccccata gtggacgccc agggtggaga agactacgaa aacttcctta 4320 

tgtacagtga tgacgtcctg cggtccccag ccagcagcca gaggcccagc gtttctgatg 4380 

acactgagca cctggtgaat ggccggatgg actttgccta tccaggcagc gccaactccc 44 4 0 

tgcacagaat gactgcagcc aatgtggcct atggcacgca tctgagccca cacctgtccc 4500 

accgagtgct gagcacgtcc tccaccctta cccgggacta ccactctctg acacgcacag 4560 

accactccca ctcaggcaca cttcccaggg actactccac cctcacttcc ctttcctccc 4620 

aagcctccct cctatctggg aagatgggag gagcaggctt ccgctgtcct ggactcttgg 4680 

gtccttgagc cgggctcaca tgaagggtgt gcccgcatcc aggggttcac cagactctat 4740 

aatcctggcc gggcagtcag cagcaccctc ctggggtaca ggattcccgt ggggctgtgg 4800 

gtgtgcctga cacacccact cggctggtgt tctctgccct ggggcgcacg tctttgaagg 48 60 

tgagctggca ggagccacag tgtgatcgga cgctgctggg ctacagtgtg gaataccagc 4920 

tactaacgtg cgtcgagatg catcggctca acatccctaa ccctggccaa acctcggtgg 4980 

tggtagagga tctcctgcct aactactctc atgtgttccg ggtacgggca cagagccagg 504 0 

agggctgggg tcgagagcga gagggtgtca tcaccatcga gtcccaggtg cacccgcaga 5100 

gccctctctg ccccctgcca ggctcagcct tcactctgag cacccccagc gccccaggac 5160 

cactggtgtt cactgcccta agcccagact ccccgcagct cagctgggag cggccgagga 5220 

gccgcaatgg agatatcctt ggctacctgg tgacctgtga gatggcccaa ggaggagcac 5280 

cagccaggac cttccgggtg gacggagaca accctgagag ccggttgact gtacctggcc 534 0 

tcagtgagaa cgttccttac aagttcaagg ttcaggccag gacgaccgag ggctttgggc 5400 

cagagcgtga gggtatcatc accatcgagt ctcaggttgg aggccccttc ccacagctgg 5460 

gcagcaattc tgggctcttc cagaacccag tgcaaagcga gttcagcagc gtgaccagca 5520 

cgcacagcac cacgactgag cccttcctca tggatggtct aaccctgggg acccagcgcc 5580 

tggaagcagg aggctccctc acccggcatg tgacccagga attcgtgacc cggaccttaa 5640 

cggccagtgg ctctctcagc actcatatgg accaacagtt cttccaaacc tgaacctccc 5700 

ccgcgcccca gccacctggg cccctccttg cctcctcfccc tagcgccttc ttcctctgct 5760 

gctctaccca cgagcttgct gaccacagag ccagcccctg tagtcagaga gcaggggtag 5820 

gtgctgtcca ggaaccataa agtgggtaga ggtgatacaa ggtctttctg actgcatccc 5880 

accctgggtc caatcccaca tgtaacc 5907 



<210> 8 
<211> 1466 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note = 
synthetic construct 



<400> 8 






























Bet 


Ala 


Gly 


Pro 


Cys 


Cys 


Ser 


Pro 


Trp 


Val 


Lys 


Leu 


Leu 


Leu 


Leu 


Ala 


1 








5 










10 










15 




Arg 


Met 


Leu 


Ser 


Ala 


Ser 


Leu 


Pro 


Gly 


Asp 


Leu 


Ala 


Asn 


Arg 


Cys 


Lys 








20 










25 










30' 






Lys 


Ala 


Gin 


Val 


Lys 


Ser 


Cys 


Thr 


Glu 


Cys 


lie 


Arg 


Val 


Asp 


Lys 


Ser 






35 










40 








45 




cys 


Ala 


Tyr 


Cys 


Thr 


Asp 


Glu 


Leu 


Phe 


Lys 


Glu 


Arg 


Arg 


Cys 


Asn 


Thr 




50 










55 










60 ' 
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Gin 


Ala 


Asp 


Val 


65 






Met 


Glu 


Ser 


Ser 


Leu 


His 


Arg 


Ser 








100 


Arg 


Gly 


Glu 


Glu 






115 




Ser 


Pro 


Val 


Asp 




130 




Asp 


Asd 




A s o 


145 








Leu 


Arg 


Gin 


Leu 


Asp 


Lys 


Val 


Ser 








180 


Glu 


Pro 


Trp 


Pro 






195 




Ser 


Leu 


Thr 


Glu 




210 






Ara 


lie 


Ser 

*-J 


Gl v 


225 








Leu 


Gin 


Thr 


Ala 


Thr 


His 


Leu 


Leu 








260 


Asp 


Gly 


Ala 


Asn 






275 




Cys 


His 


Leu 


Asp 




290 






T vr 


Pro 


Ser 


Val 


305 








lie 


Pro 


lie 


Phe 


Leu 


His 


Lys 


Tyr 








34 0 


Ser 


Ser 


Asn 


j« 






355 




Ser 


Asn 


Leu 


Asp 




37 0' 






Glu 


Val 


Th^ 


Ser 

W» >-L_ 


385 








lie 


Lys 


Arg 


Gly 


Glu 


Asp 


lie 


Asp 








420 


Gly 


Gly 


Asn 


DC 






435 




Asp 


Ala 


Ser 


Val 




4 50 






val 


Arg 


Ser 


Ala 










Cys 


Val 


Cys 


Asn 


Gly 


S er 


Leu 


Ser 








500 


Pro 


Cys 


Ser 


Gly 






515 




Gly 


Glu 


Gly 


Arg 




530 






Cys 


Pro 


Arg 


Thr 


545 









Leu 


Ala 


Ala 


Gly 




70 






XJ U£ 


(2,1 \ i 

vD X U. 


Tic, 


Thr 

X i IX 


85 








Gin 


Val 


Ser 


Pro 


Arg 


Thr 


Phe 


Val 








12 0 


Leu 


Tyr 


3u >*L. Ci^ 


Leu 






135 




Asn 


Leu 


Lys 


Gin 




150 




Thr 

X I 1 X 






iyx. 


165 








Val 


Pro 


Gin 


Thr 


Asn 


Ser 


Asp 


Pro 








200 


Asn 


Val 


Glu 


Glu 






jL ^5 




Asn 


Leu 


Asp 


Ala 




230 






Ma 1 


Gys 


JL L 


7\ *v~ f^r 
MX y 


24 5 








Val 


Phe 


Ser 


Thr 


Val 


Leu 


Ala 


Gly 








280 


Ala 


Ser 


Gly 


Ala 






295 




Pro 


Thr 


Leu 


Val 




310 






jr-i JL <i 


v a J. 


X I LX 


7\ y~\ 


3 2 5 








Phe 


Pro 


Val 


Ser 


Val 


Glu 


Leu 


Leu 








360 


lie 


Arg 


Ala 


Leu 






375 




Asp 


Thr 


Leu 


Gin 




390 






(21 n 


V^ 1 




Thr 

x x ir 


4 05 








Glv 


Thr 


His 


Val 


His 


Leu 


Lys 


Pro 








44 0 


lie 


Cys 


Asp 


Val 






455 




Arg 


Cys 


His 


Phe 




47 0 






Glu 


Gly 


Trp 


Ser 


485 








Asp 


Thr 


Gin 


Pro 


His 


Gly 


Glu 


Cys 








520 


Tyr 


Glu 


Gly 


His 






535 




Ser 


Gly 


Phe 


Leu 




520 







16 



Cys 


Arg 


Gly 


Glu 






75 




Glu 


Asn 


Thr 


Gin 




90 








bay 


jueu 


Pin 

Gin 


105 








Phe 


Gin 


Val 


Phe 


Met 


Asp 


Phe 


Ser 








140 


Met: 


Gly 


Gin 


Asn 






»L *5 ^3 




Thr 


He 


Gly 


Phe 




170 








rite? L 


2i v ?~f 

z^rg 


P y f-\ 

ClU 


185 








Pro 


Phe 


Ser 


Phe 


Phe 


Trp 


Asn 


Lys 








220 


Pro 


Glu 


Gly 


Gly 






235 




Asp 


He 


Gly 


Trp 




250 






(21 i- 


oSx 


7\1 
Z-i_L d 


r ne 


265 








lie 


Met 


Asn 


Arg 


Tyr 


Thr 


Gin 


Tyr 








300 


Arg 


Leu 


Leu 


Ala 






315 




Tyr 


Ser 


Tyr 


Ser 




330 






o sr 


xj©U 


Gly 


v ai 


345 








Glu 


Glu 


Ala 


Phe 


Asp 


Ser 


Pro 


Arg 








38 0 


Lys 


Thr 


Glu 


Thr 






395 




Tyr 


Asn 


Val 


His 




410 








rj n 

will 


XJ u 


nld 


425 








Ser 


Phe 


Ser 


Asp 


Cys 


Pro 


Cys 


Glu 








4 60 


Arg 


Gly 


Asp 


Phe 






475 




Gly 


Lys 


Thr 


Cys 




490 






Cys 


Leu 


Arg 


Glu 


505 








Gin 


Cys 


Gly 


Arg 


Phe 


Cys 


Glu 


Tyr 








540 


Cys 


Asn 


Asp 


Arg 






555 





Ser 


He 


Leu 


Val 








80 


lie 


Val 


Thr 


Ser 






95 




Val 


Arg 


Leu 


Arg 




110 






GlU 


Pro 


Leu 


Glu 


125 








Asn 


Ser 


Met 


Ser 


Leu 


Ala 


Lys 


He 








160 


Gly 


Lys 


Phe 


Val 






175 




Glu 


Lys 


Leu 


Lys 




190 




T, t 7 n 

JjyS 


Son 
as n 


v ai 


ne 


205 








Leu 


Gin 


Gly 


Glu 


Phe 


Asp 


Ala 


He 








240 


Arg 


Ala 


Asp 


Ser 






255 




His 


Tyr 


Glu 


Ala 




27 0 






j\s n 


Asp 


G1U 


Lys 


285 








Lys 


Thr 


Gin 


Asp 


Lys 


His 


Asn 


He 








320 


Tyr 


Tyr 


Glu 


Lys 






335 




Leu 


Gin 


Glu 


Asp 




350 






lyr 


Arg 


lie 


Arg 


365 








Gly 


Leu 


Arg 


Thr 


Gly 


Ser 


Phe 


His 








4 00 


Leu 


Arg 


Ala 


Val 






415 




Lys 


Glu 


Asp 


Gin 




430 






Giy 


Leu 


Arg 


Met 


445 








Leu 


Gin 


Lys 


Glu 


Met 


Cys 


Gly 


His 








4 80 


Asn 


Cys 


Ser 


Thr 






495 




Gly 


Glu 


Asp 


Lys 




510 






Cys 


Val 


Cys 


Tyr 


525 








Asp 


Asn 


Phe 


Gin 


Gly 


Arg 


Cys 


Ser 



560 
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Met 


Gly 


Glu 


Cys 


Val 


Cys 


Glu 


Pro 


Gly 










565 










Cys 


Pro 


Leu 


Ser 


Asn 


Ala 


Thr 


Cys 


lie 








580 








585 


Asn 


Gly 


Arg 


Gly 


Tyr 


Cys 


Glu 


Cys 


Gly 






595 










600 




Ser 


Leu 


Tyr 


Thr 


Asp 


Thr 


Thr 


Cys 


Glu 




610 










615 






Gly Leu 


Cys 


Glu 


Asp 


Leu 


Arg 


Ser 


Cys 


625 










630 






Thr 


Gly 


Glu 


Lys 


Lys 


Gly 


Arg 


Ala 


Cys 










64 5 










Lys 


Met 


Val 


Asp 


Glu 


Leu 


Lys 


Lys 


Glu 








660 










665 


Phe 


Arg 


Asp 


Glu 


Asp 


Asp 


Asp 


Cys 


Thr 






675 










680 




Asp 


Gly 


Ser 


Pro 


Gly 


Pro 


Asn 


Ser 


Thr 




690 










695 






Asp 


Cys 


Leu 


Pro 


Ala 


Pro 


Sex- 


Trp 


T ro 


7 05 










710 




Leu 


Leu 


Leu 


Leu 


Leu 


Ala 


Leu 


Leu 


Leu 










725 










Ala 


Cys 


Cys 


Lys 


Al a 


Cys 


Leu 


Gly 


Leu 








740 










745 


His 


Met 


Val 


Gly 


Phe 


Lys 


Glu 


Asp 


His 






755 










7 60 




Met 


A.1 a 


Ser 


Asp 


His 


Leu 


Asp 




Pro 




770 










775 






Lys 


Gly 


Arg 


Asp 


Thr 


Val 


Arg 


Trp 


Lys 


785 










790 








Pro 


Gly 


Phe 


Ala 


Thr 


His 


Ala 


Ala 


Ser 










805 










Pro 


Tyr 


Gly 


Leu 


Ser 


Leu 


Arg 


Leu 


Gly 








820 










825 




Lys 


Pro 


Gly 


Thr 


Arg 


Glu 


Cys 


Asp 






835 










84 0 




Glu 


Asn 


Leu 


Asn 


Glu 


Val 


Tyr 


Arg 


Gin 




850 










855 




Gin 


Gin 


Thr 


Lys 


Phe 


Arg 


Gin 


Gin 


Pro 


8 65 










87 0 








His 


Thr 


lie 


Val 


Asp 


Thr 


val 


Leu 


Leu 










885 










Met 


Leu 


Leu 


Lys 


Leu 


Thr 


Glu 


Lys 


Gin 








900 








905 


Glu 


Leu 


Lys 


Val 


Ala 


Pro 


Gly 


Tvr 


Tyr 






915 










920 




Ala 


Arg 


Gly 


Met 


Val 


Glu 


Phe 


Gin 


Glu 




930 










935 






Arg 


Val 


Pro 


Leu 


Phe 




Arg 


Pro 


Glu 


945 










950 






Leu 


Val 


Glu 


Ala 


lie 


Asp 


Val 


Pro 


Val 










965 










Arg 


Lsu 


Val 


Asn 


Tie 


Thr 


He 


lie 


Lys 








980 










985 


Ser 


Phe 


Glu 


Gin 


Pro 


Glu 


Tyr 


Ser 


Val 






995 








1000 


Arg 


lie 


Pro 


Val 


lie 


Arg 


His 


He 


Leu 




1010 








1015 




Ser 


Tyr 


Ser 


Thr 


Gin 


Asp 


Asn 


Thr 


Ala 


1025 










1030 






Pro 


Val 


Glu 


Gly 


Glu 


Leu 


Leu 


Phe 


His 



1045 



PCT7US01/32127 



17 



Trp 


Thr 


Gly 


Arg 


Ser 


Cys 


Asp 


5 70 










575 




Asp 


Ser 


Asn 


Gly 


Gly 


lie 


Cys 










590 






Arg 


Cys 


His 


Cys 


Asn 


Gin 


Gin 








605 








DC Is >5 


Asn 


Tyr 


Ser 


Ala 


He 


Leu 






62 0 










Val 


Gin 


Cys 


Gin 


Ala 


Trp 


Gly 




635 










640 


Asp 


Asp 


Cys 


Pro 


Phe 


Lys 


Val 


650 










65 5 




Glu 


Val 


Val 


Glu 


Tyr 


Cys 


Ser 










67 0 






Tyr 


Ser 


Tyr 


Asn 


Val 


Glu 


Gly 








685 








Val 


Leu 


Val 


His 


Lys 


Lys 


Lys 






700 










Leu 


He 


Pro 


Leu 


Leu 


He 


Phe 




715 










720 


Leu 


Leu 


Cys 


Trp 


Lys 


Tyr 


Cys 


730 










735 




Leu 


Pro 


Cys 


Cys 


Asn 


Arg 


Gly 










750 






Tyr 


Met 


Leu 


Arg 


Glu 


Asn 


Leu 








765 








Met 


Leu 


Arg 


Ser 


Gly 


Asn 


Leu 






780 










He 


Thr 


Asn 


Asn 


Val 


Gin 


Arg 




795 










8 00 


Thr 


Ser 


Pro 


Thr 


Glu 


Leu 


Val 


810 










815 




Arg 


Leu 


Cys 


Thr 


Glu 


Asn 


Leu 










830 






Gin 


Leu 


Arg 


Gin 


Glu 


Val 


Glu 








845 








Val 


Ser 


Gly 


Ala 


His 


Lys 


Leu 






8 60 










Asn 


Ala 


Gly 


Lys 


Lys 


Gin 


Asp 




875 










880 


Ala 


Pro 


Arg 


Ser 


Ala 


Lys 


Gin 


890 










895 




Val 


Glu 


Gin 


Gly 


Ser 


Phe 


His 










910 






Thr 


Val 


Thr 


Ala 


Glu 


Gin 


Asp 








925 






Gly 


Val 


Glu 


Leu 


Val 


Asp 


Val 






940 








Asp 


Asp 


Asp 


Glu 


Lys 


Gin 


Leu 




955 










960 


Ser 


Thr 


Ala 


Thr 


Leu 


Gly 


Arg 


97 0 










975 


Glu 


Gin 


Ala 


Ser 


Gly 


Val 


Val 










990 






Ser 


Arg 


Gly 


Asp 


Gin 


Val 


Ala 








1005 






Asp 


Asn 


Gly 


Lys 


Ser 


Gin 


Val 






1020 








His 


Gly 


His 


Arg 


Asp 


Tyr 


Val 




1035 








1040 


Pro 


Gly 


Glu 


Thr 


Trp 


Lys 


Glu 


1050 








1055 
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Leu Gin Val Lys Leu Leu Glu Leu Gin Glu Vai Asp Ser Leu Leu Arg 

1060 1065 1070' 

Gly Arg Gin Val Arg Arg Phe Gin Val Gin Leu Ser Asn Pro Lys Phe 

1075 1080 1085 

Gly Ala Arg Leu Gly Gin Pro Ser Thr Thr Thr Val lie Leu Asp Glu 

1090 1095 1100 

Thr Asp Arg Ser Leu lie Asn Gin Thr Leu Ser Ser Pro Pro Pro Pro 
1105 1110 1115 1120 

His Gly Asp Leu Gly Ala Pro Gin Asn Pro Asn Ala Lys Ala Ala Glv 

1125 1130 1135 

Ser Arg Lys He His Phe Asn Trp Leu Pro Pro Pro Gly Lys Pro Met 

1140 1145 1150 

Gly Tyr Arg Val Lys Tyr Trp He Gin Gly Asp Ser Glu Ser Glu Ala 

1155 1160 1165 

His Leu Leu Asp Ser Lys Val Pro Ser Val Glu Leu Thr Asn Leu Tyr 

1170 1175 1130 

Pro Tyr Cys Asp Tyr Glu Met Lys Val Cys Ala Tyr Gly Ala Lys Gly 
1185 1190 1195 " 1200 

Glu Gly Pro Tyr Ser Ser Leu Val Ser Cys Arg Thr His Gin Glu Val 

1205 1210 1215 

Pro Ser Glu Pro Gly Arg Leu Ala Phe Asn Val Val Ser Ser Thr Val 

1220 1225 1230 

Thr Gin Leu Ser Trp Ala Glu Pro Ala Glu Thr Asn Gly Glu He Thr 

1235 1240 1245 

Ala Tyr Glu Val Cys Tyr Gly Leu Val Asn Glu Asp Asn Arg Pro He 

1250 1255 1260 

Gly Pro Met Lys Lys Val Leu Val Asp Asn Pro Lys Asn Arg Met Leu 
1265 1270 1275 1280 

Leu lie Glu Asn Leu Arg Asp Ser Gin Pro Tyr Arg Tyr Thr Val Lys 

1285 1290 1295 

Ala Arg Asn Gly Ala Gly Trp Gly Pro Glu Arg Glu Ala He He Asn 

1300 1305 " 1310 

Leu Ala Thr Gin Pro Lys Arg Pro Met Ser He Pro He lie Pro Asp 

1315 1320 1325 

He Pro He Val Asp Ala Gin Gly Gly Glu Asp Tyr Glu Asn Phe Leu 

1330 1335 1340 

Met Tyr Ser Asp Asp Val Leu Arg Ser Pro Ala Ser Ser Gin Arg Pro 
1345 1350 1355 1360 

Ser Val Ser Asp Asp Thr Glu His Leu Val Asn Gly Arg Met Asp Phe 

1365 1370 1375 

Ala Tyr Pro Gly Ser Ala Asn Ser Leu His Arg Met Thr Ala Ala Asn 

1380 1385 1390 

Val Ala Tyr Gly Thr His Leu Ser Pro His Leu Ser His Ara Val Leu 

1395 1400 1405 

Ser Thr Ser Ser Thr Leu Thr Arg Asp Tvr His Ser Leu Thr Ara Thr 

1410 1415 1420 

Glu His Ser His Ser Gly Thr Leu Pro Arg Asp Tyr Ser Thr Leu Thr 
1425 1430 1435 1440 

Ser Leu Ser Ser Gin Ala Ser Leu Leu Ser Glv Lys Met Gly Gly Ala 

1445 1450 ' " 1455 

Gly Phe Arg Cys Pro Gly Leu Leu Gly Pro 

1460 1465 



<210> 9 
<211> 22 05 
<212> DMA 

<213> Artificial Sequence 
<22 0> 

<223> Description of Artificial Sequence; Note - 
synthetic construct 
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<4Q0> 9 

atggcagggc cctgttgcag cccatgggtg aagctgctgc tgctggcacg aatgctgagt 60 

gccagcctcc ctggagacct ggccaaccgc tgcaagaagg ctcaggtgaa gagctgtacc 120 

gagtgcatcc gggtggacaa gagctgtgcc tactgcacag acgagctgtt caaggagagg 180 

cgctgcaaca cccaggcgga cgttciggct gcaggctgca ggggagagag catcctggtc 240' 

atggagagca gccttgaaat cacagagaac acccagatcg tcaccagcct gcaccgcagc 300 

caggtatctc cccaaggcct gcaagtccgg ctgcggcggg gtgaggagcg cacgtttgtg 360 

ttccaggtct ttgagcccct ggagagcccc gtggatctgt atatcctcat ggacttctcc 420 

aactccatgt ctgacgatct ggacaacctc aagcagatgg ggcagaacct ggccaagatc 480 

ctgcgccagc tcaccagcga ctacaccatt ggatttggaa agtttgtgga caaagtcagc 540 

gtcccacaga cagacatgag gcccgagaaa ctgaaggagc cctggcccaa cagtgatccc 600 

ccgttctcct bcaagaacgt tatcagctta acggagaatg tggaagaatt ctggaacaaa 660 

ctgcaaggag aacgcatctc aggcaacctg gacgctcctg aagggggctt tgatgccatc 720 

ctgcagacag ctgtgtgcac aagggacatt ggctggaggg ctgacagcac ccacctgctg 780 

gtgttctcca ccgagtctgc cttccactac gaggctgatg gtgccaacgt tctggccggc 840 

atcatgaacc gcaatgatga gaaatgccac ctggacgcct cgggcgccta cacccaatac 900 

aagacacagg actacccatc agtgcccacg ctggttcgcc tgcttgccaa gcataacatc 960 

atccccatct ttgctgtcac caactactct tacagctact atgagaagct ccataagtat 1020 

ttccccgtct cctctctggg cgtcctgcag gaggattcat ccaacatcgt ggagctgctg 1080 

gaggaggcct tctatcgaat tcgctccaac ctggacatcc gggctctgga cagccccaga 114 0 

ggcctgagaa cagaggtcac ctccgatact ctccagaaga cggagactgg gtcctttcac 1200 

atcaagcggg gggaagtggg cacatacaat gtgcatctcc gggcagtgga ggacatagat 1260 

gggacacatg tgtgccagct ggctaaagaa gaccaagggg gcaacatcca cctgaaaccc 1320 

tccttctctg atggcctccg gatggacgcg agtgtgatct gtgacgtgtg cccctgtgag 1380 

ctgcaaaagg aagttcgatc agctcgctgt cacttcagag gagacttcat gtgtggacac 1440 

tgtgtgtgca atgagggctg gagtggcaaa acctgcaact gctccaccgg ctctctgagt 1500 

gacacacagc cctgcctgcg tgagggtgag gacaaaccgt gctcgggcca cggcgagtgc 1560 

cagtgcggac gctgtgtgtg ctatggtgaa ggccgctacg agggtcactt ctgcgagtat 1620 

gacaacttcc agtgtccccg gacctctgga ttcctgtgca atgaccgggg acgctgttct 1680 

atgggagagt gtgtgtgtga gcctggttgg acaggccgca gctgcgactg tcccctcagc 1740 

aatgccacct gcatcgatag caacgggggc atctgcaacg gccgaggcta ctgtgagtgt 1800 

ggccgttgtc actgcaacca gcagtcgctc tacacggaca ccacctgtga gafccaactac 1860 

tctgcgatac tgggtctctg tgaggatctc cgctcctgcg tacagtgcca ggcctggggc 1920 

accggggaga agaaagggcg cgcgtgtgac gattgcccct ttaaagtcaa gatggtagac 1980 

gagcttaaga aagaagaggt ggtggagtac tgctccttcc gggatgagga tgacgactgc 204 0 

acttacagct acaacgtgga gggcgacggc agccctgggc ccaacagcac agfecctggtc 2100 

cacaaaaaga aagactgcct cccggctcct tcctggtggc tcatccccct gctcatctfcc 2160 

ctcctgttgc tcctggcgtt gcttctgctg ctctgctgga aatga 2205 



<210> 10 
<211> 734 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence; Note = 
synthetic construct 

<4 00> 10 



Met 


Ala 


Gly 


Pro 


Cys 


Cys 


Ser 


Pro 


Trp 


Val 


Lys 


Leu 


Leu 


Leu 


Leu 


Ala 


1 








5 










10 










15 




Arg 


Met 


Leu 


Ser 


Al a 


Ser 


Leu 


Pro 


Gly 


Asp 




Al a 


Asn 


Arg 


Cys 


Lys 








20 










25 










30 


Lys 


Ala 


Gin 


Val 


Lys 


Ser 


Cys 


Thr 


Glu 


Cys 


3* 3*t 


Arg 


Val 


Asp 


Lys 


Ser 






35 










4 0 










45 




Cys 


Ala 


Tyr 


Cys 


Thr 


Asp 


Glu 


Leu 


Phe 


Lys 


Glu 


Arg 


Arg 


Cys 


Asn 


Thr 




5 0 










55 










60 






Gin 


Ala 


Asp 


Val 


Leu 


Ala 


Ala 


Gly 


Cys 


Arg 


Gly 


Glu 


Ser 


Xle 


Leu 


Val 


65 










70 








75 










80 


Met 


Glu 


Ser 


Ser 


Leu 


Glu 


lie 


Thr 


Glu 


Asn 


Thr 


Gin 


He 


Val 


Thr 


Ser 



8 5 90 95 
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Leu 


His 


Arg 


Ser 


Gin 


Val 


Ser 


Pro 








100' 










Arg 


Gly 


Glu 


Glu 


Arg 


Thr 


Phe 


val 






115 










120 


Ser 


Pro 


Val 


Asp 


Leu 


Tyr 


He 


Leu 




130 










135 




Asp 


Asp 


Leu 


Asp 


Asn 


Leu 


Lys 


Gin 


145 










150 






Leu 


Arg 


Gin 


Leu 


Thr 


Ser 


Asp 


Tyr 










165 








Asp 


Lys 


Val 


Ser 


Val 


Pro 


Gin 


Thr 








180 










Glu 


Pro 


Trp 


Pro 


Asn 




Asp 


Pro 






195 










2 00 


Ser 


Leu 


Thr 


Glu 


Asn 


Val 


Glu 


Glu 




210 










215 




Arg 


lie 


Ser 


Gly 


Asn 


Leu 


Asp 


Ala 


225 










230 




Leu 


Gin 


Thr 


Ala 


Val 


Cys 


Thr 


Arg 










245 






Thr 


His 


Leu 


Leu 


Val 


Phe 


Ser 


Thr 








260 










Asp 


Gly 


Ala 


Asn 


Val 


Leu 


Ala 


Gly 






275 










280 


Cys 


His 


Leu 


Asp 


Ala 


Ser 


Gly 


Ala 




290 










295 




Tyr 


Pro 


Ser 


Val 


Pro 


Thr 


Leu 


Val 


305 










310 






lie 


Pro 


lie 


Phe 


Ala 


Val 


Thr 


Asn 










325 








Leu 


His 


Lys 


Tyr 


Phe 


Pro 


Val 


Ser 








340 










Ser 


Ser 




lie 


Val 


Glu 


Leu 


Leu 






*3 ^3 










3 60 


Ser 


Asn 


Leu 


Asp 


He 


Arg 


Ala 


Leu 




370 










375 




Glu 


Val 


Thr 


Ser 


Asp 


Thr 


Leu 


Gin 


385 








390 






lie 


Lys 


Arg 


Gly 


Glu 


Val 


Gly 


Thr 










4 05 








Glu 


Asp 


He 


Asp 


Gly 


Thr 


His 


Val 








4 20 










Gly 


Gly 


Asn 


lie 


His 


Leu 


Lys 


Pro 






435 










440 


Asp 


Aid. 


Ser 


Val 


He 


Cys 


Asp 


Val 




450 










455 




val 


Arg 


Ser 


Ala 


Arg 


Cys 


His 


Phe 


4 65 










470 






cys 


v a J. 


Cys 


Asn 


Glu 


Gly 


Trp 


Ser 










4 85 








Gly 


Ser 


Leu 


Ser 


Asp 


Thr 


Gin 


Pro 








500 










Pro 


Cys 


Ser 


Gly 


His 


Gly 


Glu 


Cys 






515 










520 


Gly 


Glu 


Gly 


Arg 


Tyr 


Glu 


Gly 


His 




530 










535 




Cys 


Pro 


Arg 


Thr 


Ser 


Gly 


Phe 


Leu 


545 










550 






Met 


Gly 


Glu 


Cys 


Val 


Cys 


Glu 


Pro 










5 65 








Cys 


Pro 


Leu 


Ser 


Asn 


Ala 


Thr 


Cys 



58 0 



2 0 



Gin 


Gly 


Leu 


Gin 


Val 


Arg 


Leu 


Arg 


105 










110 






Phe 


Gin 


Val 


Phe 


Glu 


Pro 


Leu 


Glu 










125 








Met 


Asp 


Phe 


Ser 


Asn 


Ser 


Met 


Ser 








14 0 










Met 


Gly 


Gin 


Asn 


Leu 


Ala 


Lys 


Tie 






15 5 








160 


Thr 


He 


Gly 


Phe 


Gly 


Lys 


Phe 


Val 




17 0 






175 




Asp 


Met 


Arg 


Pro 


Glu 


Lys 


Leu 


Lys 


185 










190 






Pro 


Phe 


Ser 


Phe 


Lys 


Asn 


Val 


He 










205 








Phe 


Trp 


Asn 


Lys 


Leu 


Gin 


Gly 


Glu 








220 










Pro 


Glu 


Gly 


Gly 


Phe 


Asp 


Ala 


He 






235 










24 0 


Asp 


He 


Gly 


Trp 


Arg 


Ala 


Asp 


Ser 




250 










255 




Glu 


Ser 


Ala 


Phe 


His 


Tyr 


Glu 


Ala 


265 










270 






He 


Met 


Asn 


Arg 


Asn 




Glu 


Lys 










285 








Tyr 


Thr 


Gin 


T Y r 


Lys 


Thr 


Gin 


Asp 








300 










Arg 


Leu 


Leu 


Ala 


Lys 


His 


Asn 


He 






3 3. -5 










320 


Tyr 


Ser 


Tyr 


Ser 


Tyr 


Tyr 


Glu 


Lys 




330 










335 




Ser 


Leu 


Gly 


Val 


Leu 


Gin 


Glu 


Asp 


345 










350 




Glu 


Glu 


Ala 


Phe 


Tyr 


Arg 


He 


Arg 










3 65 








Asp 


Ser 


Pro 


Arg 


Gly 


Leu 


Arg 


Thr 








38 0 










Lys 


Thr 


Glu 


Thr 


Gly 


Ser 


Phe 


His 






395 










4 00 


Tyr 


Asn 


Val 


His 


Leu 


Arg 


Ala 


Val 




410 










4 15 




Cys 


Gin 


Leu 


Ala 


Lys 


Glu 


Asp 


Gin 


425 










430 




Ser 


Phe 


Ser 


Asp 


Gly 


Leu 


Arg 


Met 










4 4 5 








Cys 


Pro 


Cys 


Glu 


Leu 


Gin 


Lys 


Glu 








460 










Arg 


Glv 


Asp 


Phe 


Met 


Cys 


Gly 


His 






475 










480 


Gly 


Lys 


Thr 


Cys 


Asn 


Cys 

mi. 


Ser 


Thr 




490 










4 95 




Cys 


Leu 


Arg 


Glu 


Gly 


Glu 


Asp 


Lys 


505 










510 






Gin 


Cys 


Gly 


Arg 


Cys 


Val 


Cys 


Tyr 










525 








Phe 


Cys 


Glu 


Tyr 


Asp 


Asn 


Phe 


Gin 








54 0 










Cys 


Asn 


Asp 


Arg 


Gly 


Arg 


Cys 








555 










5 60 


Gly 


Trp 


Thr 


Gly 


Arg 


Ser 


Cys 


Asp 




57 0 










575 




He 


Asp 


Ser 


Asn 


Gly 


Gly 


He 


Cys 


585 










590 
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Asn 


Gly 


Arg 


Gly 


Tyr 


Cys 


Glu 


Cys 


Gly 


Arg 


Cys 


His 


Cys 


Asn 


Gin 


Gin 






595 










600 










605 








Ser 


Leu 


Tyr 


Thr 


Asp 


Thr 


Thr 


Cys 


Glu 


L It ^5 


Asn 


Tyr 


Ser 


Ala 


lie 


Leu 




610' 










615 










620 










Gly 


Leu 


Cys 


Glu 


Asp 


Leu 


Arg 


Ser 


Cys 


Val 


Gin 


Cys 


Gin 


Ala 


Trp 


Gly 


625 










630 










635 










640 


Thr 


Gly 


Glu 


Lys 


Lys 


Gly 


Arg 


Ala 


Cys 


Asp 


Asp 


Cys 


Pro 


Phe 


Lys 


Val 










645 










65 0 










655 




Lys 


Met 


Val 


Asp 


Glu 


Leu 


Lys 


Lys 


Glu 


Glu 


Val 


Val 


Glu 


Tyr 


Cys 


Ser 








660 










665 










67 0 






Phe 


Arg 


Asp 


Glu 


Asp 


Asp 


Asp 


Cys 


Thr 


Tyr 


Ser 


Tyr 


Asn 


Val 


Glu 


Gly 






675 










68 0 










685 








Asp 


Gly 


Ser 


Pro 


Gly 


Pro 


Asn 


Ser 


Thr 


Val 


Leu 


Val 


His 


Lys 


Lys 


Lys 




690 










695 










700 










Asp 


Cys 


Leu 


Pro 


Ala 


Pro 


Ser 


Trp 


Trp 


Leu 


lie 


Pro 


Leu 


Leu 


lie 


Phe 


705 








710 










715 










720 


Leu 


Leu 


Leu 


Leu 


Leu 


Ala 


Leu 


Leu 


Leu 


Leu 


Leu 


Cys 


Trp 


Lys 














725 










730 









<210> 11 
<211> 3579 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence; Note - 
synthetic construct 



<400> 11 

atgcctgcgc tctggctcag ctgctgcctc ggtgtcgcgc tcctgctgcc cgccagccag 60 

gccacctcca ggagggaagt ctgtgattgc aatgggaagt ccaggcaatg tgtctttgat 120 

caggagctcc atcgacaagc aggcagcggg ttccgttgcc tcaactgcaa tgacaataca 180 

gcgggggttc actgcgagcg gtcgagggag gggttttacc agcatcagag caagagccgc 240 

tgcctaccct gcaactgcca ctcaaagggt tccctcagtg ctggatgtga caactctgga 300 

caatgcaggt gtaagccagg tgtgacagga caaagatgtg accagtgtca gccaggcttc 360' 

catatgctca ccgatgcigg atgcacccga gaccaggggc aactagattc caagtgtgac 420 

tgtgacccag ctggcatctc tggaccctgt gattctggcc gatgtgtctg caaaccagcc 480 

gtcactggag agcgctgtga taggtgccga ccacgtgact atcatctgga ccgggcaaac 540 

cctgagggct gtacccagtg tttctgctat gggcattcag ccagctgcca cgcctctgcc 600 

gacttcagtg tccacaaaat cacttcaact ttcagtcagg atgtggatgg ttggaaggcg 660 

gttcagagaa acggggcacc tgcaaaactc cactggtcac agcgccatcg ggacgtgttt 720 

agttctgccc gaagatcaga ccccgtcfcat ttegtggccc ctgccaaatt cctcggtaac 780 

cagcaagtga gttacgggca gagcctgtct tttgactacc gcgtggacag aggaggtaga 340 

cagccgtctg cctacgatgt gatcctggaa ggtgctggtc tacagatcag agctcctctg 900 

atggctccag gcaagacacfc tccfctgtggg atcacaaaga cttacacatt cagactgaat 960 

gaacatccaa gcagtcactg gagtccccag ctgagttatt tcgaatatcg aaggttactg 1020 

cggaacctca cagccctcct gatgatccga gctacgtacg gagaata tag tacagggtac 1080 

at tgat aacg tgaccctggt ttcagcccgc cctgtcct bg gagccccagc cccttgggt t 1140 

gaacgttgtg tatgcctgct ggggtacaag ggacaattct gccagcaa tg tgcttctggt 1200 

tacaaaagag attcggcaag attgggcgct tttggcgcct gtgttccctg haactgccaa 1260 

ggggaggggg cctgtgatcc agacacggga gattgctact cgggggacga gaatcctgac 1320 

attgagtgtg ctgactgtcc catcggtttc tacaatgacc cacatgaccc ccgcagctgc 1380 

aagccatgtc cctgtcacaa fcgggfctcagc tgttcagtga tgcctgagac agaggaggtg 1440 

gtgtgtaaca actgtccccc tggggtcaca ggtgcccgct gtgagctctg tgctgatggc 1500 

ttctttgggg atccctttgg ggaaca fcggc ccagtgaggc cttgtcaacg ctgccaatqc 1560 

aacaacaacg bggaccccaa tgcctctggg aacfcgtgacc agttgacagg cagatgcttg 1620 

aaatgtatct acaacacggc cggtgtctac tgtgaccagt gcaaagcagg ttacttfcgga 1680 

gacccattgg ctcccaaccc agcagacaag tgtcgagctt gcaactgcag ccccatgggfc 1740 

gcggagcctg gagagtgtcg aggtgatggc agctgtgttt gcaagccagg ctttggcgcc 1800 

ttcaactgtg atcacgcagc cctaaccagt tgtcctgctt cctacaatca agtgaagatt I860 

cagatggacc agt ttaccca gcagctccag agcctggagg ccctggtttc aaaqgctcsg 1920 

ggtgg tggtg gtggtggtac agtcccagtg cagctggaag gcaggatcga gcaggctgag 1980 
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eaggcccttc aggacattct gggagaagct cagatttcag aaggggcaat gagagccgtt 2040' 

gctgtccggc tggccaaggc aaggagccaa gagaacgact acaagacccg cctggatgac 2100 

ctcaagatga ctgcagaaag gatccgggcc ctgggcagtc agcatcagaa cagagttcag 2160 

gatacgagca gacteatctc tcagatgcgc ctgagtctgg caggaagcga agctctcttg 2220 

gaaaacacta atatccatfcc ttcfcgagcac tacgtggggc cgaatgattt taaaagtctg 2280 

gctcaggagg ctacaagaaa ggcagacagc cacgctgagt cagctaacgc aatgaagcaa 2340 

ctagcaaggg aaactgagga ctactccaaa caagcacttt cattggcccg caagctcttg 2400 

agtggaggag gcggaagtgg ctcttgggac agctccgtgg tacaaggtct tatgggaaaa 2460 

ttagagaaaa ceaagtccct gagccagcag ctgtcattgg agggcaccca agccgacatt 2520 

gaagctgata ggtcgtatca gcacagtctc cgcctcctgg attctgcctc tcagcttcag 2580 

ggagtcagtg atctgtcctt tcaggtggaa gcaaagagga tcagacaaaa ggctgattct 2640 
ctctcaaacc tggtgaccag acaaacggat gcattcacgc gtgtgcgaaa caatctgggg 

aactgggaaa aagaaacacg gcagctttta cagactggaa aggataggag acagacbtca 2760 

gatcagctgc tttcccgtgc caaccttgct aaaaacagag cccaagaagc gctaagtatg 2820 

ggcaatgcca ctttttatga agttgagaac atcctgaaga acctccgaga gtttgatctg 2880 

caggttgaag acagaaaagc agaggctgaa gaggccatga agagactctc ctctattagc 2940 

cagaaggttg cggatgccag tgacaagacc cagcaagcag aaacggccct ggggagcgcc 3000 

actgccgaca cccaacgggc aaagaacgca gctagggagg ccctggagat cagcagcgag 3060 

atagagctgg agatagggag tctgaacttg gaagctaatg tgacagcaga tggggccttg 3120 

gccatggaga aagggactgc cactctgaag agcgagatga gagagatgat tgagctggcc 3180 

agaaaggagc tggagtttga cacggataag gacacggtgc agctggtgat tractgaagcc 3 ? 40 

cagcaagctg atgccagagc cacgagtgcc ggagttacca tccaagacac rctcaacaca 3300 

ttggacggca tcctacacct catagaccag cctggcagtg tggatgaaga agggatgatg 3360 

ctattagaac aagggctttt ccaagccaag acccagatca acagtcgact tcggcccttg 3420 

atgtctgacc tggaggagag ggtgcgtcgg cagaggaacc acctccatct gctggagact 3480 

agcatagatg gaattcttgc tgatgtgaag aacctggaga acattcgaga caacctgccc 3540 

ccaggctgct acaataccca agctcttgag caacagtga 3579 



<210> 12 
<2U> 1192 
<212> PPT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence; Note = 
synthetic construct 



<400> 12 





Pro 


Ala 


Leu 


Trp 


Leu 




Cys 


Cys 


Leu 


Gly 


Val 


Ala 


Leu 


Leu 


Leu 


i 








5 










10 










15 




Pro 


Ala 


Ser 


Gin 
20 


Ala 


Thr 


Ser 


Arg 


Arg 
25 


Glu 


val 


Cys 


Asp 


Cys 
30 


Asn 


Gly 


Lys 


Ser 


Arg 


Gin 


Cys 


Val 


Phe 


Asp 


Gin 


Glu 


Leu 


His 


Arg 


Gin 


Ala 


Gly 




35" 










4 0 










45~ 










Gly 
50 


Phe 


Arg 


Cys 


Leu 


Asn 
55 


Cys 


Asn 


Asp 


Asn 


Thr 
60 


Ala 


Gly 


Val 


His 


Cys 


Glu 


Arg 


Ser 


Arg 


Glu 


Gly 


Phe 


Tyr 


Gin 


His 


Gin 


Ser 


Lys 


Ser 


Arg 


65 










70 










75 










80" 


Cys 


Leu 


Pro 


Cys 


Asn 


Cvs 

-* 


His 


Ser 


Lys 


Gly 


Ser 


Leu 


S er 


Ala 


Gly 


Cys 








85 










90 










95 




Asp 


Asn 


Ser 


Gly 


Gin 


Cys 


Arg 


Cys 


Lys 


Pro 


Gly 


Val 


Thr 


Gly 


Gin 


A.r g 






100 










105 










110 






Cys 


Asp 


Gin 


Cys 


Gin 


Pro 


Gly 


Phe 


His 


Met 


Leu 


Thr 


Asp 


Ala 


Gly 


Cys 




115 










120 










125 








Thr 


Arg 
130 


Asp 


Gin 


Gly 


Gin 


Leu 
135 


Asp 


Ser 


Lys 


Cys 


Asp 
140 


Cys 


Asp 


Pro 


Ala 


Gly 


lie 


Ser 


Gly 


Pro 


Cys 


Asp 


Ser 


Gly 


Arg 


Cys 


Val 


Cys 


Lys 


Pro 


Ala 


145 










150 










155 










160 


val 


Thr 


Gly 


Glu 


Arg 
165 


Cys 


Asp 


Arg 


Cys 


Arg 
170 


Pro 


Arg 


Asp 


Tyr 


His 
175 


Leu 


Asp 


Arg 


Ala 


Asn 


Pro 


Glu 


Gly 


Cys 


Thr 


Gin 


Cys 


Phe 


Cys 


Tyr 


Gly 


His 



130 185 190 
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Ser Ala Ser Cys 
195 

Ser Thr Phe Ser 
210 

Gly Ala Pro Ala 
225 

Ser Ser Ala Arg 

Phe Leu Gly Asa 

2 60 

Tyr Arg Val Asp 
275 

Leu Glu Gly Ala 
290 

Lys Thr Leu Pro 
305 

Glu His Pro Ser 

Arg Arg Leu Leu 

34 0 

Tyr Gly Glu Tyr 
3 55 

Ala Arg Pro Val 
370 

Cys Leu Leu Gly 
385 

Tyr Lys Arg Asp 

Cys Asn Cys Gin 

420 

Tyr Ser Gly Asp 
435 

Gly Phe Tyr Asn 
450 

Cys His Asn Gly 
465 

Val Cys Asn Asn 

Cys Ala Asp Gly 

500 

Arg Pro Cys Gin 
515 

Ser Gly Asn Cys 
5 30 

Asn Thr Ala Gly 
545 

Asp Pro Leu Ala 

Ser Pro Met Gly 

58 0 

Val Cys Lys Pro 
595 

Thr Ser Cys Pro 
610 

Phe Thr Gin Gin 
625 

Gly Gly Gly Gly 

Glu Gin Ala Glu 

660 

Ser Glu Gly Ala 
675 



His Ala Ser Ala 

2 00 

Gin Asp Val Asp 
215 

Lys Leu His Trp 
230 

Arg Ser Asp Pro 
24 5 

Gin Gin Val Ser 

Arg Gly Gly Arg 

280 

Gly Leu Gin lie 
295 

Cys Gly Tie Thr 
310 

Ser His Trp Ser 
325 

Arg Asn Leu Thr 

Ser Thr Gly Tyr 

360 

Leu Gly Ala Pro 
375 

Tyr Lys Gly Gin 
390 

Ser Ala Arg Leu 
405 

Gly Glu Gly Ala 

Glu Asn Pro Asp 

440 

Asp Pro His Asp 
455 

Phe Ser Cys Ser 
470 

Gys Pro Pro Gly 
485 

Phe Phe Gly Asp 

Arg Cys Gin Cys 

520 

Asp Gin Leu Thr 
535 

Val Tyr Cys Asp 
550 

Pro Asn Pro Ala 
565 

Ala Glu Pro Gly 

Gly Phe Gly Ala 

600 

Ala Cys Tvr Asn 
615 

Leu Gin Ser Leu 
630 

Gly Gly Thr Val 
645 

Gin Ala Leu Gin 

Met Arg Ala Val 

680 



23 



Asp Phe Ser Val 

Gly Trp Lys Ala 

220 

Ser Gin Ara His 
235 

Val Tyr Phe Val 
250 

Tyr Gly Gin Ser 
265 

Gin Pro Ser Ala 

Arg Ala Pro Leu 

300 

Lys Thr Tyr Thr 
315 

Pro Gin Leu Ser 
330 

Ala Leu Leu Met 
345 

He Asp Asn Val 

Ala Pro Trp Val 

38 0 

Phe Cys Gin Glu 

395 

Gly Ala Phe Gly 
410 

Cys Asp Pro Asp 
425 

lie Glu Cys Ala 

Pro Arg Ser Cys 

4 60 

Val Met Pro Glu 
475 

Val Thr Gly Ala 
4 90 

Pro Phe Gly Glu 
505 

Asn Asn Asn Val 

Gly Arg Cys Leu 

54 0 

Gin Cys Lys Ala 
555 

Asp Lys Cys Arg 
570 

Glu Cys Arg Gly 
585 

Phe Asn Cys Asp 

Gin Val Lys He 

620 

Glu Ala Leu Val 
635 

Pro Val Gin Leu 
650 

Asp He Leu Gly 
665 

Ala Val Arg Leu 



His Lys lie Thr 
2 05 

Val Gin Arg Asn 

Arg Asp Val Phe 

240 

Ala Pro Ala Lys 
255 

Leu Ser Phe Asp 
270 

Tyr Asp Val He 
285 

Met Ala Pro Gly 

Phe Arg Leu Asn 

320 

Tyr Phe Glu Tyr 
335 

He Arg Ala Thr 
350 

Thr Leu Val Ser 
365 

Glu Arg Cys Val 

Cys Ala Ser Gly 

400 

Ala Cys Val Pro 
415 

Thr Gly Asp Cys 
43 0 

Asp Cys Pro He 
445 

Lys Pro Cys Pro 

Thr Glu Glu Val 

4 80 

Arg Cys Glu Leu 
495 

His Gly Pro Val 
510 

Asp Pro Asn Ala 

Lys Cys He Tyr 

Gly Tyr Phe Gly 

560 

Ala Cys Asn Cys 
575 

Asp Gly Ser Cys 
5 90 

His Ala Ala Leu 
605 

Gin Met Asp Gin 

Ser Lys Ala Gin 

64 0 

Glu Gly Arg He 
655 

Glu Ala Gin He 
67 0 

Ala Lys Ala Arg 
685 
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Ser 


Gin 


Glu 


Asn 


Asp 


Tyr 


Lys 


Thr 


Arg 


Leu 


Asp 


Asp 


Leu 


Lys 


Met 


Thr 




690 










695 










7 00 








Ala 


Glu 


Arg 


12 e 


Arg 


Ala 


Leu 


Gly 


Ser 


Gin 


His 


Gin 


Asn 


Arg 


Val 


Gin 


7 05 










710 










715 








72 0 


Asp 


Thr 


Ser 


Arg 


Leu 
725 


He 


Ser 


Gin 


Met 


Arg 
7 30 


Leu 


Ser 


Leu 


Ala 


Gly 
735 


Ser 


Glu 


Ala 


Leu 


Leu 


Glu 


Asn 


Thr 


Asn 


He 


His 


Ser 


Ser 


Glu 


His 


Tyr 


Val 








740 










745 










7 50 




Gly 


Pro 


Asn 


Asp 


Phe 


Lys 


Ser 


Leu 


Ala 


Gin 


Glu 


Ala 


Thr 


Arg 


Lys 


A. la 






755 










760 










765 




Asp 


Ser 


His 


Ala 


Glu 


Ser 


Ala 


Asn 


Ala 


Met 


Lys 


Gin 


Leu 


Ala 


Arg 


Glu 




770 










775 








780 








Thr 


Glu 


Asp 


Tyr 


Ser 


Lys 


Gin 


Ala 


Leu 


Ser 


Leu 


Ala 


Arg 


Lys 


Leu 


Leu 


/85 










7 90 










795 






800 


Ser 


Gly 


Gly 


Gly 


Gly 


Ser 


Gly 


Ser 


Trp 


Asp 


Ser 


TT* 


Val 


Val 


Gin 


Gly 










8 05 










810 










815 


Leu 


Met 


Gly 


Lys 
820 


Leu 


Glu 


Lys 


Thr 


Lys 
825 


Ser 


Leu 


Ser 


Gin 


Gin 

830 


Leu 


Ser 


Leu 


Glu 


Gly 
835 


Thr 


Gin 


Ala 


Asp 


He 

840 


Glu 


Ala 


Asp 


Arg 


Ser 
845 


Tyr 


Gin 


His 


Ser 


Leu 


Arg 


Leu 


Leu 


Asp 


Ser 


Ala 


Ser 


Gin 


Leu 


Gin 


Gly 


Val 


Ser 


Asp 




8 50 










855 










860 






Leu 


Ser 


Phe 


Gin 


Val 


Glu 


Ala 


Lys 


Arg 


lie 


Arg 


Gin 


Lys 


Ala 


Asp 


Ser 


865 










870 










875 








880 


Leu 


Ser 


Asn 


Leu 


Val 


Thr 


Arc 


Gin 


Thr 


Asp 


Ala 


Phe 


Thr 


Arg 


Val 


Arg 










885 










8 90 








895 


Asn 


Asn 


Leu 


Glv 


Asn 


Trp 


Glu 


Lys 


Glu 


Thr 


Arg 


Gin 


Leu 


Leu 


Gin 


Thr 








900 










905 








910 






Gly 


Lys 


A.sp 


Arg 


Arg 


Gin 


Thr 


^3 *C 


Asp 


Gin 


Leu 


Leu 


Ser 


Arg 


Al a 


Asn 






915 










920 










925 






Leu 


Ala 


Lys 


Asn 


Arg 


Ala 


Gin 


Glu 


Ala 


Leu 


Ser 


Met 


Gly 


Asn 


Ala 


Thr 




93 0 










^3 ^3 










94 0 








Phe 


Tyr 


Glu 


Val 


Glu 


Asn 


He 


Leu 


Lys 


Asn 


Leu 


Arg 


Glu 


Phe 


Asp 


Leu 


945 










950 










955 








960 


Gin 


Val 


Glu 


Asp 


Arg 
965 


Lys 


Ala 


Glu 


Ala 


Glu 
970 


Glu 


Ala 


Met 


Lys 


Arg 
975 


Leu 


Ser 


S er 


lie 


Ser 


Gin 


Lys 


Val 


Ala 


Asp 


Ala 


Ser 


Asp 


Lys 


Thr 


Gin 


Gin 








980 










985 






990 






Ala 


Glu 


Thr 


Ala 


Leu 


Gly 


Ser 


Ala 


Thr 


Ala 


Asp 


Thr 


Gin 


Arg 


Ala 


Lys 






995 










1000 








1005 




Asn 


Ala 


Ala 


Arg 


Glu 


Ala 


Leu 


Glu 


Tie 


Ser 


Ser 


Glu 


He 


Glu 


Leu 


Glu 




1010 








1015 








1020 








lie 


Gly 


Ser 


Leu 


Asn 


Leu 


Glu 


Ala 


Asn 


Val 


Thr 


Ala 


Asp 


Gly 


Ala 


Leu 


1025 








1030 








1035 




1040 


Ala 


Met 


Glu 


Lys 


Gly 


Thr 


Ala 


Thr 


Leu 


Lys 


Ser 


Glu 


Met 


Arg 


Glu 


Met 










1045 








1050 






1055 


lie 


Glu 


Leu 


Ala 


Arg 


Lys 


Glu 


Leu 


Glu 


Phe 


Asp 


Thr 


Asp 


Lys 


Asp 


Thr 








1060 








1065 








1070 




Val 


Gin 


Leu 


Val 


He 


Thr 


Glu 


Ala 


Gin 


Gin 


Ala 


A.sp 


Ala 


Arg 


Ala 


Thr 






1075 








108C 


) 






1085 






Ser 


Ala 


Gly 


Val 


Thr 


lie 


Gin 


Asp 


Thr 


Leu 


Asn 


Thr 


Leu 


Asp 


Gly 






1090 








1095 








1100 




Leu 


Hxs 


Leu 


lie 


Asp 


Gin 


Pro 


Gly 


Ser 


Val 


Asp 


Glu 


Glu 


Gly 


Met 


Met 


1105 








111C 










1115 






1120 


Leu 


Leu 


Glu 


Gin 


Gly 


Leu 


Phe 


Gin 


Ala 


Lys 


Thr 


Gin 


He 


Asn 


Ser 


Arg 










1125 










1130 








1135 





Leu Arg Pro Leu Met Ser Asp Leu Glu Glu Arg Val Arg Arq Gin Arg 

1140 1145 " 1150 

Asn His Leu His Leu Leu Glu Thr Ser lie Asp Gly He Leu Ala Asp 

1155 1160 " 1165 

Val Lys Asn Leu Glu Asn He Arg Asp Asn Leu Fro Pro Gly Cys Tyr 
11^0 H75 * H80 
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Asn Thr Gin Ala Leu Glu Gin Gin 
1185 1190 

<210> 13 
<2ll> 3582 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note = 
synthetic construct 

<4 00> 13 

atgcctgcgc tctggctggg ctgctgcctc tgcttctcgc tcctcctgcc cgcagcccgg 60 

gccacctcca ggagggaagt ctgtgattgc aatgggaagt ccaggcagtg tatctttgat 120' 

cgggaacttc acagacaaac tggtaatgga ttccgctgcc tcaactgcaa tgacaacact 180 

gatggcattc actgcgagaa gtgcaagaat ggcttttacc ggcacagaga aagggaccgc 240 

tgtttgccct gcaattgtaa ctccaaaggt tctcttagtg ctcgatgtga caactctgga 30'0 

cggtgcagct gtaaaccagg tgtgacagga gccagatgcg accgatgtct gccaggcttc 3 60 

cacatgctca cggatgcggg gtgcacccaa gaccagagac tgctagactc caagtgtgac 420 

tgtgacccag ctggcatcgc agggccctgt gacgcgggcc gctgtgtctg caagccagct 4 80 

gttactggag aacgctgtga taggtgtcga tcaggttact ataatctgga tggggggaac 54 0 

cctgagggct gtacccagtg tttctgctat gggcattcag ccagctgccg cagctctgca 600 

gaatacagtg tccataagat cacctctacc tttcatcaag atgttgatgg ctggaaggct 660 

gtccaacgaa atgggtctcc tgcaaagctc caatggtcac agcgccatca agatgtgttt 720 

agctcagccc aacgactaga ccctgtctat tttgtggctc ctgccaaatt tcttgggaat 780 

caacaggtga gctatgggca aagcctgtcc tttgactacc gtgtggacag aggaggcaga 840 

cacccatctg cccatgatgt gattctggaa ggtgctggtc tacggatcac agctcccttg 900 

atgccacttg gcaagacact gccttgtggg ctcaccaaga cttacacatt caggttaaat 960 

gagcatccaa gcaataattg gagcccccag ctgagttact ttgagtatcg aaggttactg 1020 

cggaatctca cagccctccg catccgagct acatatggag aatacagtac tgggtacatt 1080 

gacaatgtga ccctgatttc agcccgccct gtctctggag ccccagcacc ctgggttgaa 114 0 

cagtgtatat gtcctgttgg gtacaagggg caattctgcc aggattgtgc ttctggctac 1200 

aagagagatt cagcgagact ggggcctttt ggcacctgta ttccttgtaa ctgtcaaggg 1260 

ggaggggcct gtgatccaga cacaggagat tgttattcag gggatgagaa tcctgacatt 1320 

gagtgtgcfcg actgcccaat tggtttctac aacgatccgc acgacccccg cagctgcaag 1380 

ccatgtccct gtcataacgg gttcagcfcgc tcagtgatgc cggagacgga ggaggtggtg 1440 

tgcaataact gccctcccgg ggtcaccggt gcccgcfcgtg agctctgtgc tgatggctac 1500 

tttggggacc cctttggtga acatggccca gtgaggcctt gtcagccctg tcaatgcaac 1560 

aacaatgtgg accccagtgc ctcfcgggaat tgtgaccggc tgacaggcag gtgtttgaag 1620 

tgtafcccaca acacagccgg catctactgc gaccagtgca aagcaggcta cttcggggac 1680 

ccattggctc ccaacccagc agacaagtgt cgagctfcgca actgtaaccc catgggctca 1740 

gagcctgtag gatgtcgaag tgatggcacc tgtgtttgca agccaggatt tggtggcccc 1800 

aactgtgagc atggagcatt cagctgtcca gcttgcfeata atcaagtgaa gattcagatg 1860 

gatcagttta tgcagcagct tcagagaatg gaggccctga tttcaaaggc tcagggtggt 1920 

gatggagtag tacctgatac agagctggaa ggcaggatgc agcaggctga gcaggccctt 1980 

caggacattc tgagagatgc ccagatttca gaaggtgcta gcagatccct tggtctccag 2040 

ttggccaagg tgaggagcca agagaacagc taccagagcc gcctggatga cctcaagatg 2100 

actgtggaaa gagttcgggc tctgggaagt cagtaccaga accgagttcg ggatactcac 2160 

aggctcatca ctcagatgca gctgagcctg gcagaaagtg aagcttcctt gggaaacact 2220 

aacattcctg cctcagacca ctacgfcgggg ccaaatggct ttaaaagtct ggctcaggag 2280 

gccacaagat tagcagaaag ccacgttgag tcagccagta acatggagca actgacaagg 2340 

gaaactgagg actattccaa acaagccctc tcactggtgc gcaaggccct gcatgaagqa 2400 

gtcggaagcg gaagcggtag cccggacggt gctgtggtgc aagggcttgt ggaaaaattg 2460 

gagaaaacca agtccctggc ccagcagttg acaagggagg ccacfccaagc ggaaattgaa 2520 

gcagataggt cttafccagca cagtctccgc ctcctggatt cagtgtctcg gcttcaggca 2580 

gfccagtgatc agtcctttca ggtggaagaa gcaaagagga tcaaacaaaa agcggattca 2640 

ctctcaacgc tggtaaccag gcatatggat gagttcaagc gtacacaaaa gaatctggga 2700 

aactggaaag aagaagcaca gcagctctra cagaatggaa aaagtgggag agagaaatca 2760 

gatcagctgc tttcccgtgc caatcttgcfc aaaagcagag cacaagaagc actgagtatg 2820 

ggcaatgcca ctttttatga acttgagagc atccttaaaa acctcagaga gtfctgacctg 2880 

caggtggaca acagaaaagc agaagctgaa gaagccatga agagactctc ctacatcagc 2940 
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cagaaggttt cagafcgccag tgacaagacc cagcaagcag aaagagccct ggggagcgct 3000 

gctgctgatg cacagagggc aaagaatggg gccggggagg ccctggaaat ctccagtgag 3060 

attgaacagg agattgggag tctgaacttg gaagccaatg tgacagcaga tggagccttg 3120' 

gccatggaaa agggactggc ctctctgaag agtgagatga gggaagtgga aggagagcig 3180 

gaaaggaagg agctggagtt tgacacgaat atggatgcag tacagatggt gattacagaa 3240 

gcccagaagg ttgataccag agccaagaac gctggggtta caatccaaga cacactcaac 3300 

acattagacg gcctcctgca tctgatggac cagccfcctca gtgtagatga agaggggctg 3360 

gtcttactgg agcagaagct ttcccgagcc aagacccaga tcaacagcca actgcggccc 34 20 

atgatgtcag agctggaaga gagggcacgt cagcagaggg gccacctcca tttgctggag 3480 

acaagcatag atgggattct ggctgatgtg aagaacttgg agaacattag ggacaacctg 3540 

cccccaggct gctacaatac ccaggctctt gagcaacagt ga 3582 

<210> 14 
<211> 1193 
<212> PRT 

<213> Artificial Sequence 



<22 0> 

<223> Description of Artificial Sequence; Note = 
synthetic construct 



<400> 14 



Met 


Pro 


Ala 


Leu 


Trp 


Leu 


Gly 


Cys 


Cys 


Leu 


Cys 


Phe 


Ser 


Leu 


Leu 


Leu 


1 








5 










10 










15 




Pro 


Ala 


Ala 


Arg 
2 0 


Ala 


Thr 


Ser 


Arg 


Arg 
25 


Glu 


Val 


Cys 


Asp 


Cys 
30 


Asn 


Gly 


Lys 


Ser 


Arg 
35 


Gin 


Cys 


1 1 e 


Phe 


Asp 
4 0 


Arg 


Glu 


Leu 


His 


Arg 
45 


Gin 


Thr 


Gly 


Asn 


Gly 


Phe 


Arg 


Cys 


Leu 


Asn 


Cys 


Asn 


Asp 


Asn 


Thr 


Asp 


Gly 


lie 


His 




50 










55 










60 








Cys 


Giu 


Lys 


Cys 


Lys 


Asn 


Gly 


Phe 


Tyr 


Arg 


His 


Arg 


Glu 


Arg 


Asp 


Arg 


65 










7 0 










75 










8 0 


Cys 


Leu 


Pro 


Cys 


Asn 
85 


Cys 


Asn 


Ser 


Lys 


Gly 
SO 


Ser 


Leu 


Ser 


Ala 


Arg 
95 


Cys 


Asp 


Asn 


Ser 


Gly 


Arg 


Cys 


Ser 


Cys 


Lys 


Pro 


Gly 


Val 


Thr 


Gly 


Ala 


Arg 








100 










105 










110 




Cys 


Asp 


Arg 


Cys 


Leu 


Pro 


Gly 


Phe 


His 


Met 


Leu 


Thr 


Asp 


Ala 


Gly 


Cys 






115 










120 










125 




Thr 


Gin 


Asp 


Gin 


Arg 


Leu 


Leu 


Asp 


Ser 


Lys 


Cys 


Asp 


Cys 


Asp 


Pro 


Ala 




130 










135 










140 








Gly 


lie 


Ala 


Gly 


Pro 


Cys 


Asp 


Ala 


Gly 


Arg 


Cys 


Val 


Cys 


Lys 


Pro 


Ala 


14 5 










150 










155 










160 


Val 


Thr 


Gly 


Glu 


Arg 
165 


Cys 


Asp 


Arg 


Cys 


Arg 
17 0 


Ser 


Gly 


Tyr 


Tyr 


Asn 
175 


Leu 


Asp 


Gly 


Gly 


Asn 


Pro 


Glu 


Gly 


Cys 


Thr 


Gin 


Gys 


Phe 


Cys 


Tyr 


Gly 


His 








180 










185 










190 




Ser 


Ala 


Ser 


Cys 


Arg 


Ser 


Ser 


Ala 


Glu 


Tyr 


Ser 


Val 


His 


Lys 


lie 


Thr 






195 










200 










205 






Ser 


Thr 


Phe 


Hr s 


Gin 


Asp 


Val 


Asp 


Gly 


Trp 


Lys 


Ala 


Val 


Gin 


Arg 


Asn 




210 










215 










220 








Gly 


Ser 


Pro 


Ala 


Lys 


Leu 


Gin 


Trp 


Ser 


Gin 


Arg 


His 


Gin 


Asp 


Val 


Phe 


225 










230 










235 








240 


Ser 


Ser 


Ala 


Gin 


Arg 


Leu 


Asp 


Pro 


Val 


Tyr 


Phe 


Val 


Ala 


Pro 


Ala 


Lys 










245 










250 










255 


Phe 




Gly 


Asn 


Gin 


Gin 


Val 


Ser 


Tyr 


Gly 


Gin 


Ser 


Leu 


Ser 


Phe 


Asp 








260 










265 










270 




Tyr 


Arg 


Val 


Asp 


Arg 


Gly 


Gly 


Arg 


His 


Pro 


Ser 


Ala 


His 


Asp 


Val 


lie 






275 










28 0 










285 






Leu 


Glu 


Gly 


Ala 


Gly 




Arg 


lie 


Thr 


Ala 


Pro 


Leu 


Met 


Pro 


Leu 


Gly 




290 










295 










390 








Lys 


Thr 


Leu 


Pro 


Cys 


Gly 


Leu 


Thr 


Lys 


Thr 


Tyr 


Thr 


Phe 


Arg 


Leu 


Asn 



305 310 315 320 
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Glu 


His 


Pro 


Ser 


Arg 


Arg 


Leu 


Leu 








340 


Gly 


Glu 


Tyr 


Ser 






355 




Arg 


Pro 


Val 


Ser 




370 






Pro 


Val 


Gl v 


x j J ~ 


385 








Lys 




Asp 


Ser 


Asn 


Cys 


Gin 


Gly 








420 


Ser 


Gly 


Asp 


Glu 






435 




Phe 


Tyr 


Asn 


A sp 




450 








no 1 1 


gi \z 




4 65 








Cys 


Asn 


Asn 


Cys 


Ala 


Asp 


Gly 


Tyr 








500 


Pro 


Cys 


Gin 


Pro 






515 




Gly 


Asn 


Cys 


Asp 




530 






Thr 


Ala 


Gl v 


Tie 


545 








Pro 


Leu 


.Al a 


Pro 


Pro 


Met 


Gly 


Ser 








580 


Cys 


Lys 


Pro 


Gly 






595 




Cys 


Pro 


Ala 


Cys 




610 






Gin 


Gin 


T n p> 1 1 

JJC L-t 


Gl n 


625 








Asp 


Gly 


Val 


Val 


Glu 


Gin 


Ala 


Leu 








6 60 


Ala 


Ser 


Arg 


Ser 






675 




Asn 


Ser 


Tyr 


Gin 




690 






Val 




Ala 


Leu 


7 05 








Arg 


Leu 


lie 


Thr 


Leu 


Gly 


Asn 


Thr 








740 


Gly 


Phe 


Lys 


Ser 






755 




Val 


Glu 


Ser 


Ala 




7 7 o> 






Tyr 


Ser 


Lys 


Gin 


785 








Val 


Gly 


Ser 


Gly 



A s n 


Asn 


Tro 


Ser 

W \_r 


325 








Arg 


Asn 


Leu 


Thr 


Thr 


Gly 


Tyr 


>i-K 3^ 








360 


Gly 


Ala 


Pro 


Ala 






375 




Lys 


Gly 


Gin 


Phe 




390 








rii y 




Gl \/ 


405 








Gly 


Gly 


Ala 


Cys 


Asn 


Pro 


Asp 


lie 








44 0 


Pro 


His 


Asp 


Pro 






455 




Ser 


Cys 


Ser 


Val 




47 0 






D T" f\ 


rXQ 


^=sX y 


V dJL 


485 








Phe 


Gly 


Asp 


Pro 


Cys 


Gin 


Cys 


Asn 








52 0 


Arg 


Leu 


Thr 


Gly 






535 




Tyr 


Cys 


Asp 


Gin 




550 






jrio i x 


ClU 


ftJL a. 


HSp 


565 








Glu 


Pro 


Val 


Gly 


Phe 


Gly 


Gly 


Pro 








600 


Tyr 


Asn 


Gin 


Val 






615 




Arg 


Met 


Glu 


Ala 




630 






T*> v*- 


TV o 

ASp 


TPV-i ~v 
JL I IX 


w JL. u 


645 






Gin 


Asp 


He 


Leu 


Leu 


Gly 


Leu 


Gin 






680 


Ser 


Arg 


Leu 


Asp 






695 




Gly 


S ^3 >ir 


Gin 


Tyr 




710 






Gin 


Met 


Gin 


Leu 


725 








Asn 


He 


Pro 


Ala 


Leu 


Ala 


Gin 


Glu 








7 60 


Ser 


Asn 


Met 


Glu 






775 




Ala 


Leu 


Ser 


Leu 




790 






Ser 


Gly 


Ser 


Pro 



8 05 



27 



Pro 


Gin 


Leu 


Ser 




330 






Aj a 


T v P 1 T 


Arn 


Tie 

JL, 


345 








Asp 


Asn 


Val 


Thr 


Pro 


Trp 


Val 


Glu 








380 


Cys 


Gin 


Asp 


Cys 






395 




Pro 


Phe 


Gly 


Thr 




410 












Thr 

-L I 1 _L 


425 








Glu 


Cys 


Ala 


Asp 


Arg 


Ser 


Cys 


Lys 








4 60 


Met 


Pro 


Glu 


Thr 






475 




Thr 


Gly 


Ala 


Arg 




490 






irne 


PI 17 

<crXy 


Pin 


nlS 


505 








Asn 


Asn 


Val 


Asp 


Arg 


Cys 


Leu 


Lys 








54 0 


Cys 


Lys 


Ala 


Gly 






555 




Lys 


Cys 


Arg 


Ala 




570 






\— o 


raJL y 


Q O T 
O t: JL 


It c-pi 

.£-1 O kJ 


585 








Asn 


Cys 


Glu 


His 


Lys 


3._ 1 6 


Gin 


Met 








62 0 


Leu 


lie 


Ser 


Lys 






635 




Leu 


Glu 


Gly 


Arg 




650 






jri j_ y 


Zi r"i 




Gl ri 


665 








Leu 


Ala 


Lys 


Val 


Asp 


Leu 


Lys 


Met 








7 00 


Gin 


Asn 


Arg 


Val 






715 




Ser 


Leu 


Ala 


Glu 




730 






Ser 


Asp 


His 




745 








Ala 


Thr 


Arg 


Leu 


Gin 


Leu 


Thr 


Arg 








78 0 


Val 


Arg 


Lys 


Ala 






795 




Asp 


Gly 


Ala 


Val 




310 







Tyr 


Phe 


Glu 


Tyr 






335 




Arg 


Ala 


Thr 


Tyr 




35 0 








lie 

j-U 


Ser 

O v— *JU 


Ala 


365 








Gin 


Cys 


Tie 


Cys 


Ala 


Ser 


Gly 


Tyr 








4 00 


Cys 


lie 


Pro 


Cys 






415 




Gly 


Asp 


Cys 


Tyr 




430 






v_i _y o 


Prn 


J — LC 


Gl \7 


445 








Pro 


Cys 




Cys 


Glu 


Glu 


Val 


Val 








48 0 


Cys 


Glu 


Leu 


Cys 






495 




Gly 


Pro 


Val 


Arg 




510 






P y ri 


O (Zs T~ 




ucl 


525 








Cys 


lie 


His 


Asn 


Tyr 


Phe 


Gly 


Asp 








560 


Cys 


Asn 


Cys 


Asn 






575 




Gly 


Thr 


Cys 


Val 




590 






Gl v 








605 








Asp 


Gin 


Phe 


Met 


Ala 


Gin 


Gly 


Gly 








640 


Met 


Gin 


Gin 


Ala 






655 




lie 


Ser 


Glu 


Gly 




67 0 






T\ -y- /*>f 

Arg 


^ O T* 

OCX 




p i ii 

JL U. 


685 








Thr 


Val 


Glu 


Arg 


Arg 


Asp 


Thr 


His 








72 0 


Ser 


Glu 


Ala 


Ser 






735 




Val 


Gly 


Pro 


Asn 




750 






Ala 


Glu 




His 


765 








Glu 


Thr 


Glu 


Asp 


Leu 


His 


Glu 


Gly 








8 00 


Val 


Gin 


Gly 


Leu 



815 
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^ 7 _ *1 

Val 


Glu 


Lys 


Leu 


Glu 


Lys 


Thr 


Lys 


o © ]C 


Leu 


Ala 




Gin 


Leu 


Thr 


Arg 








820 










825 










8 30 






Glu 


Ala 


Thr 


Gin 


Ala 


Glu 


lie 


Glu 


Ala 


Asp 


Arg 


Ser 


Tyr 


bin 


His 


Ser 






835 










840 










845 








Leu 


Arg 


Leu 


Leu 


Asp 


Ser 


Val 


Ser 


Arg 


Leu 


Gin 


Gly 


Val 


Ser 


Asp 


Gin 




850 










8 55 










860 










Ser 


Phe 


Gin 


Val 


Glu 


Glu 


Ala 


Lys 


Arg 


lie 


Lys 


Gin 


Lys 


Ala 


Asp 


Ser 


r~N pin. 

8 65 










87 0 










875 










880 


Leu 


Ser 


Thr 


Leu 


Val 


Thr 


Arg 


His 


r. it \ 

Met 


Asp 


Glu 


Phe 


Lys 


Arg 


Thr 


Gin 










885 










890 










895 




Lys 


Asn 


Leu 


Gly 


Asn 


Trp 


Lys 


Glu 


Glu 


Ala 


Gin 


Gin 


Leu 


Leu 


G3,n 


Asn 








900 










905 










910 






Gly 


Lys 


Ser 


Gly 


Arg 


Glu 


Lys 


Ser 


Asp 


Gin 


Leu 


Leu 


Ser 


Arg 


Ala 


Asn 






915 










92 0 










925 








Leu 


Ala 


Lys 


Ser 


Arg 


Ala 


Gin 


Glu 


Ala 


Leu 


Sex- 


Met 


Gly 


Asn 


Ala 


Thr 




930 










935 










94 0 










Phe 


Tyr 


Glu 


Val 


Glu 


Ser 




Leu 


Lys 


Asn 


Leu 


Arg 


Glu 


Phe 


Asp 


Leu 


345 










950 










955 










960 


Gin 


Val 


Asp 


Asn 


Arg 


Lys 


Ala 


Glu 


Ala 


Glu 


Glu 


Ala 


Met 


Lys 


Arg 


Leu 










965 










97 0 










975 




Ser 


Tyr 


lie 


Ser 


Gin 


Lys 


Val 


Ser 


Asp 


Ala 


Ser 


Asp 


Lys 


Thr 


Gin 


Gin 








980 










985 










990 






Ala 


Glu 


Arg 


Ala 


Leu 


Gly 


Ser 


Ala 


Ala 


Ala 


Asp 


Ala 


Gin 


Arg 


Ala 


Lys 






995 










1000 






1005 






Asn 


Gly 


Ala 


Gly 


Glu 


Ala 


Leu 


Glu 


lie 


Ser 


Ser 


Glu 




Glu 


Gin 


Glu 




1010 








1015 








1020 










Gly 


Ser 


Leu 


Asn 


Leu 


Glu 


Ala 


Asn 


Val 


Thr 


Ala 


Asp 


Gly 


Ala 


Leu 


1025 








1030 








1035 








1040 


Ala 


Met 


Glu 


Lys 


Gly 


Leu 


Ala 


Ser 


Leu 


Lys 


Ser 


Glu 


Met 


Arg 


Glu 


Val 










1045 








1050 








1055 


Glu 


Gly 


Glu 


Leu 


Glu 


Arg 


Lys 


Glu 


Leu 


Glu 


Phe 


Asp 


Thr 


Asn 


Met 


Asp 








1060 








1065 








1070 




Ala 


Val 


Gin 


Met 


Val 


lie 


Thr 


Glu 


Ala 


Gin 


Lys 


Val 


Asp 


Thr 


Arg 


Ala 






1075 








108 0 








108 5 




Lys 


Asn 


Ala 


Gly 


Val 


Thr 


lie 


Gin 


Asp 


Thr 


Leu 


Asn 


Thr 


Leu 


Asp 


Gly 




1090 






1095 








1100 








Leu 


Leu 


His 


Leu 


Met 


Asp 


Gin 


Pro 


Leu 


Ser 


Val 


Asp 


Glu 


Glu 


Gly 


Leu 


1105 








1110 








1115 








1120 


Val 


Leu 


Leu 


Glu 


Gin 


Lys 


Leu 


Ser 


Arg 


Ala 


Lys 


Thr 


Gin 


lie 


Asn 


Ser 










1125 








1130 










Gin 


Leu 


Arg 


Pro 


Met 


Met 


Ser 


Glu 


Leu 


Glu 


Glu 


Arg 


Ala 


Arg 


Gin 


Gin 








1140 








1145 








1150 




Arg 


Gly 


His 


Leu 


His 


Leu 


Leu 


Glu 


Thr 


Ser 


lie 


Asp 


Gly 


lie 


Leu 


Ala 






1155 








1160 








1165 






Asp 


Val 


Lys 


Asn 


Leu 


Glu 


Asn 




Arg 


Asp 


Asn 


Leu 


Pro 


Pro 


Gly 


Cys 




1170 








1175 








118 0 








Tyr 


Asn 


Thr 


Gin 


Ala 


Leu 


Glu 


Gin 


Gin 

















1185 1190 



<210> 15 
<211> 1111 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note = 
synthetic construct 

<400> 15 

Met Pro Ala Leu Trp Leu Gly Cys Cys Leu Cys Phe Ser Leu Leu Leu 
15 10 15 
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Pro 


Ala 


Ala 


Arg 








20 


Lys 


Ser 


Arg 


Gin 






35 




Asn 


Gly 


Phe 


Arg 




50 






Cys 


Glu 


Lys 


Cys 


65 








Cys 


Leu 


Pro 


Cys 


Asp 


Asn 


Ser 


Glv 








100 


Cys 


Asp 


Arg 


Cys 






115 




Thr 


Gin 


Asp 


Gin 




130 






Gly 


lie 


Ala 


Glv 


145 








Val 


Thr 


Gly 


Glu 


Asp 


Gly 


Gly 


Asn 








180 


Ser 


Ala 


Ser 


Cys 






195 




Ser 


Thr 


Phe 


His 




210 






SI v 


Ser 


Pro 


Ala 


225 








Ser 


Ser 


Ala 


Gin 


Phe 


Leu 


Gly 


Asn 








260 


Tyr 


Arg 


Val 


Asp 






275 




Leu 


Glu 


Gly 


Ala 




290 






Lvs 


Thr 


Leu 


Pro 


305 








Glu 


His 


Pro 


Ser 


Arg 


Arg 


Leu 


Leu 








34 0 


Gly 


Glu 


Tyr 


Ser 






355 




Arg 




Val 


Ser 




370 






Pro 


Val 


Glv 


Tur 


385 








Lys 


Arg 


Asp 


Ser 


Asn 


Cys 


Gin 


Gly 








420 


Ser 


Gly 


Asp 


Glu 






435 




Phe 


Tyr 


Asn 


Asp 




4 50 






His 


Asn 


Gly 


Phe 










cys 


Asn 


Asn 


Cys 


Ala 


Asp 


Glv 


Tyr 








500 



Ala 


Thr 


Ser 


Arg 


Cys 


He 


Phe 


Asp 








40 


Cys 


Leu 


Asn 


Cys 






55 




Lys 


Asn 


Gly 


Phe 




70 






Asn 


Cvs 


Asn 


Ser 


85 








Arg 


Cys 


Ser 


Cys 


Leu 


Pro 


Gly 


Phe 








120 


Arg 


Leu 


Leu 


Asp 






135 




Pro 


Cys 


Asp 


Ala 




150 






fix. y 


^ y o 




rlX y 


165 








Pro 


Glu 


Gly 


Cys 


Arg 


Ser 


Ser 


Ala 








2 00 


Gin 


Asp 


Val 


Asp 






215 




Lys 


Leu 


Gin 


Trp 




230 






A rrr 








245 








Gin 


Gin 


Val 


Ser 


Arg 


Gly 


Gly 


r^"X 








280 


Gly 


Leu 


Arg 


He 






295 




Cys 


Gly 


Leu 


Thr 




310 






Asn 


Asn 


Tro 


M V_> 


325 








Arg 


Asn 


Leu 


Thr 


Thr 


Gly 


Tyr 


He 








360 


Gly 


Ala 


Pro 


Ala 






375 




Lys 


Gly 


Gin 


Phe 




390 






Al a 


Arg 


Leu 


Gly 


4 05 








Gly 


Gly 


Ala 


Cys 


Asn 


Pro 


Asp 










44 0 


Pro 


His 


Asp 


Pro 






455 




Ser 


Cys 


Ser 


Val 




4 70 






Pro 


Pro 


Gly 


Val 


485 








Phe 


Gly 


Asp 


Pro 



29 



A rn 
"J- y 




V CX JL. 


civ 


25 








Arg 


Glu 


Leu 


H i s 


Asn 


Asp 


Asn 


Thr 








60 


Tyr 


Arg 


His 


Arg 






75 




Lys 


Gly 


Ser 


Leu 




90 






xjy 


Drpi 
JT Ju W 


^ y 


v d X. 


105 








His 


Met 


Leu 


Thr 


Ser 


Lys 


Cys 


Asp 








140 


Gly 


Arg 


Cys 


Val 






15 5 




Cys 


Arg 


Ser 


Gly 




170 






JL l y X 


ri n 


uys 


P"h c 
ell" 


185 






Glu 


Tyr 


Ser 


Val 


Gly 


Trp 


Lys 


Ala 








220 


Ser 


Gin 


Arg 


Irl S 






235 




Val 


Tyr 


Phe 


Val 




250 






ju y a. 


^-j ju y 


Kin 


C Ci t~ 
w JL 


2 65 








His 


Pro 


Ser 


Ala 


Thr 


Ala 


Pro 


Leu 








300 


Lys 


Thr 


Tyr 


Thr 






315 




Pro 


Gin 


Leu 


Ser 




330 






AT a 


-L_J W Ui 


A rn 

nx y 


Tip 


345 








Asp 


Asn 


Val 


Thr 


Pro 


Trp 


Val 


Glu 








38 0 


Cys 


Gin 


Asp 


Cys 






395 




Pro 


Phe 


Gly 


Thr 




410 






Asp 


Pro 


Asp 


Thr 


425 








Glu 


Cys 


Ala 


Asp 


Arg 


Ser 


Cys 


Lys 








4 60 


Met 


Pro 


Glu 


Thr 






475 




Thr 


Glv 


Ala 


Arg 




4 90 




Phe 


Glv 


Glu 


His 


505 









Asp 


Cys 


Asn 


Gly 




30 






A rrr 


Gl n 

V_. JL 1 1 


X 1 1 J. 


jl y 


45 








Asp 


Gly 


lie 


His 


Glu 


Arg 


Asp 


Arg 








30 


Ser 


Ala 


Arg 


Cys 






95 




Thr 


Gly 


Ala 


Arg 




110 






xr 


jti_L d 


inxy 


uy s 


125 








Cys 


Asp 


Pro 


Ala 


Cys 


Lys 


Pro 


Al a 








160 


Tyr 


Tyr 


Asn 


Leu 






175 




Cys 


Tyr 


Gly 


His 




190 






njLo 


uy s 


XXfc; 




205 








Val 


Gin 


Arg 


Asn 


Gin 


Asp 


Val 


Phe 








240 


Ala 


Pro 


Ala 


Lys 






255 




Leu 


Ser 


Phe 


Asp 




270 






1IXO 




V ClJU 


X X c 


285 








Met 


Pro 


Leu 


Gly 


Phe 


Arg 


Leu 


Asn 








320 


Tyr 


Phe 


Glu 


Tyr 






335 




Arg 


Ala 


Thr 


Tyr 




350 






JE-lC U. 




Q C T» 

O KX 


n-la 


365 








Gin 


Cys 


Tie 


Cys 


Ala 


Ser 


Gly 


Tyr 








4 00 


Cys 


He 


Pro 


Cys 






415 




Gly 


Asp 


Cys 


Tyr 




4 30 






Cys 


Pro 


He 


Gly 


445 








Pro 


Cys 


Pro 


Cys 


Glu 


Glu 


Val 


Val 








4 80 


Cys 


Glu 


Leu 


Cys 






4 95 




Gly 


Pro 


Val 


Arg 



510 
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Pro Cys Gin Pro Cys Gin Cys Asn Asn Asn Val Asp Pro Ser Ala Ser 

515 520 525 

Gly Asn Cys Asp Arg Leu Thr Gly Arg Cys Leu Lys Cys lie His Asn 

530 535 540' 

Thr Ala Gly lie Tyr Cys Asp Gin Cys Lys Ala Gly Tyr Phe Gly Asp 

550' 555 560 

Pro Leu Ala Pro Asn Pro Ala Asp Lys Cys Arg Ala Cys Asn Cys Asn 

565 570 575 

Pro Met Gly Ser Glu Pro Val Gly Cys Arg Ser Asp Gly Thr Cys Val 

580 585 590 

Cys Lys Pro Gly Phe Gly Gly Pro Asn Cys Glu His Gly Ala Phe Ser 

5 95 600 605 

Cys Pro Ala Cys Tyr Asn Gin Val Lys He Gin Met Asp Gin Phe Met 

610 615 620 

Gin Gin Leu Gin Arg Met Glu Ala Leu He Ser Lys Ala Gin Gly Giy 
625 630 635 640 

Asp Gly Val Val Pro Asp Thr Glu Leu Glu Gly Arg Met Gin Gin Ala 

645 650 655 

Glu Gin Ala Leu Gin Asp He Leu Arg Asp Ala Gin He Ser Glu Gly 

660 665 670 

Ala Ser Arg Ser Leu Gly Leu Gin Leu Ala Lys Val Arg Ser Gin Glu 

675 680 685 

Asn Ser Tyr Gin Ser Arg Leu Asp Asp Leu Lys Met Thr Val Glu Arg 

690 695 700 

Val Arg Ala Leu Gly Ser Gin Tyr Gin Asn Arg Val Arg Asp Thr His 
705 710 715 " 720 

Arg Leu He Thr Gin Met Gin Leu Ser Leu Ala Glu Ser Glu Ala Ser 

725 730 735 

Leu Gly Asn Thr Asn He Pro Ala Ser Asp His Tyr Val Gly Pro Asn 

740 745 750 

Gly Phe Lys Ser Leu Ala Gin Glu Ala Thr Arg Leu Ala Glu Ser His 

755 760 765 

Val Glu Ser Ala Ser Asn Met Glu Gin Leu Thr Arg Glu Thr Glu Asp 

770 775 780 

Tyr Ser Lys Gin Ala Leu Ser Leu Val Arg Lvs Ala Leu His Glu Gly 
785 790 795 800 

Val Gly Ser Gly Ser Gly Ser Pro Asp Gly Ala Val Val Gin Gly Leu 

805 810 815 

Val Glu Lys Leu Glu Lys Thr Lys Ser Leu Ala Gin Gin Leu Thr Arg 

320 825 830 

Glu Ala Thr Gin Ala Glu He Glu Ala Asp Arg Ser Tyr Gin His Ser 

835 840 845 

Leu Arg Leu Leu Asp Ser Val Ser Arg Leu Gin Gly Val Ser Asp Gin 

850 855 860 

Ser Phe Gin Val Glu Glu Ala Lys Arg He Lys Gin Lys Ala Asp Ser 
B65 870 875 ' 880 

Leu Ser Thr Leu Val Thr Arg His Met Asp Glu Phe Lys Arg Thr Gin 

885 890 895 

Lys Asn Leu Gly Asn Trp Lys Glu Glu Ala Gin Gin Leu Leu Gin Asn 

900 905 910 

Gly Lys Ser Gly Arg Glu Lys Ser Asp Gin Leu Leu Ser Arg Ala Asn 

915 920 925 

Leu Ala Lys Ser Arg Ala Gin Glu Ala Leu Ser Met Gly Asn Ala Thr 

930 935 94 0 

Phe Tyr Glu Val Glu Ser lie Leu Lys Asn Leu Arg Glu Phe Asp Leu 
945 950 955 * 96Q 

Gin Val Asp Asn Arg Lys Ala Glu Ala Glu Glu Ala Met Lys Arg Leu 

965 970 975 

Ser Tyr He Ser Gin Lys Val Ser Asp Ala Ser Asp Lys Thr Gin Gin 

980 Qgs " 990 

Ala Glu Arg Ala Leu Gly Ser Ala Ala Ala Asp Ala Gin Arg Ala Lys 
9 95 1000 1005 
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Asn Gly Ala Gly Glu Ala Leu Glu lie Ser Ser Glu He Glu Gin Glu 

1010 1015 1020 

He Gly Ser Leu Asn Leu Glu Ala Asn Val Thr Ala Asp Gly Ala Leu 
i025 1030 1035 1040 

Ala Met Glu Lys Gly Leu Ala Ser Leu Lys Ser Glu Met Arg Glu Val 

1045 1050 1055 

Glu Gly Glu Leu Glu Arg Lys Glu Leu Glu Phe Asp Thr Asn Met Asp 

1060 1065 1070 

Ala Val Gin Met Val He Thr Glu Ala Gin Lys Val Asp Thr Arg Ala 

1075 1080 1085 

Lys Asn Ala Gly Val Thr He Gin Asp Xhr Leu Asn Thr Leu Asp Gly 

1090 1095 1100 

Leu Leu His Leu Met Gly Met 
1105 1110 

<210> 16 
<211> 5622 
<212> DRA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note = 
synthetic construct 

<400> 16 

atggcagcgc cacgccccag cccatgggcc aggctgctcc tggcagcctt gatcagcgtc 60 

agcctctctg ggaccttgaa ccgctgcaag aaggccccag tgaagagctg cacggagtgt 120 

gtccgtgtgg ataaggactg cgcctactgc acagacgaga tgttcaggga ccggcgctgc 180 

aacacccagg cggagctgct ggccgcgggc tgccagcggg agagcatcgt ggtcatggag 240 

agcagcttcc aaatcacaga ggagacccag attgacacca ccctgcggcg cagccagatg 300 

tccccccaag gcctgcgggt ccgtctgcgg cccggtgagg agcggcattt tgagctggag 360 

gtgtttgagc cactggagag ccccgtggac ctgtacatcc tcatggactt ctccaactcc 420 

atgtccgatg atctggacaa cctcaagaag atggggcaga acctggctcg ggtcctgagc 480 

cagctcacca gcgactacac tattggattt ggcaagtttg tggacaaagt cagcgtcccg 540 

cagacggaca tgaggcctga gaagctgaag gagccttggc ccaacagtga cccccccttc 600 

tccttcaaga acgtcatcag cctgacagaa gatgtggatg agttccggaa taaactgcag 660 

ggagagcgga tctcaggcaa cctggatgct cctgagggcg gcttcgatgc catcctgcag 720 

acagctgtgt gcacgaggga cattggctgg cgcccggaca gcacccacct gctggtcttc 780 

tccaccgagt cagccttcca ctatgaggct gatggcgcca acgtgctggc tggcatcatg 840 

agccgcaacg atgaacggtg ccacctggac accacgggca cctacaccca gtacaggaca 900 

caggactacc cgtcggtgcc caccctggtg cgcctgctcg ccaagcacaa catcatcccc 960 

atctttgctg tcaccaacta ctcctatagc tactacgaga agcttcacac ctatttccct 1020 

gtctcctcac tgggggtgct gcaggaggac tcgtccaaca tcgtggagct gctggaggag 1080 

gccttcaatc ggatccgctc caacctggac atccgggccc tagacagccc ccgaggcctt 114 0 

cggacagagg tcacctccaa gatgttccag aagacgagga ctgggtcctt tcacatccgg 1200' 

cggggggaag tgggtatata ccaggtgcag ctgcgggccc ttgagcacgt ggatgggacg 1260 

cacgtgtgcc agctgccgga ggaccagaag ggcaacatcc atctgaaacc ttccttctcc 1320 

gacggcctca agatggacgc gggcatcatc tgtgatgtgt gcacctgcga gctgcaaaaa 1380 

gaggtgcggt cagctcgctg cagcttcaac ggagacttcg tgtgcggaca gtgtgtgtgc 144 0 

agcgagggct ggagtggcca gacctgcaac tgctccaccg gctctctgag tgacattcag 1500 

ccctgcctgc gggagggcga ggacaagccg tgctccggcc gtggggagtg ccagtgcggg 1560 

cactgtgtgt gctacggcga aggccgctac gagggtcagt tctgcgagta tgacaacttc 1620 

cagtgtcccc gcacttccgg gttcctctgc aatgaccgag gacgctgctc catgggccag 1680 

tgtgtgtgtg agcctggttg gacaggccca agctgtgact gtcccctcag caatgccacc 1740 

tgcatcgaca gcaatggggg catctgtaat ggacgtggcc actgtgagtg tggccgctgc 1800 

cactgccacc agcagtcgct ctacacggac accatctgcg agatcaacta ctcggcgtcc 1860 

acccgggcct ctgcgaggac ctacgctcct gcgtgcagtg ccaggcgtgg ggcaccggcg 1920 

agaagaaggg gcgcacgtgt gaggaatgca acttcaaggt caagatggtg gacgagctta 1980 

agagaggcga ggaggtggtg gtgcgctgct ccttccggga cgaggatgac gactgcacct 2040 

acagctacac catggaaggt gacggcgccc ctgggcccaa cagcactgtc ctggtgcaca 2100 

agaagaaggg actgccctcc gggctccttc tggtggctca tccccctgct cctcctcctc 2160 

ctgccgctcc tggccctgct actgctgcta tgctggaagt actgtgcctg ctgcaaggcc 2220 
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tgcctggcac tfcctcccgtg ctgcaaccga ggtcacatgg tgggctttaa ggaagaccac 2280' 

tacatgctgc gggagaacct gatggcctct gaccacttgg acacgcccat gctgcgcagc 2340 

gggaaccfcca agggccgtga cgtggtccgc fcggaaggtca ccaacaacat gcagcggcct 2400 

ggctttgcca ctcafcgccgc cagcahcaac cccacagagc Iggtgcccta cgggctgtcc 2460 

ttgcgcctgg cccgcctttg caccgagaac ctgctgaagc ctgacactcg ggagfcgcgcc 2520 

cagctgcgcc aggaggtgga ggagaacctg aacgaggtct acaggcagat cfcccggtgfca 2580 

cacaagcfccc agcagaccaa gttccggcag cagcccaatg ccgggaaaaa gcaagaccac 2640 

accattgtgg acacagtgct gatggcgccc cgctcggcca agccggccct gctgaagctfc 2700 

acagagaagc aggtggaaca gagggccttc cacgacctca aggtggcccc cggcfcactac 2760 

accctcactg cagaccagga cgcccggggc atggtggagt tccaggaggg cgtggagctg 2820 

gtggacgtac gggtgcccct ctttafcccgg cctgaggatg acgacgagaa gcagctgctg 2880 

gtggaggcca tcgacgtgcc cgcaggcact gccaccctcg gccgccgcct ggfeaaacatc 2940 

accatcatca aggagcaagc cagagacgtg gtgtcctttg agcagcctga gttctcggtc 3000 

agccgcgggg accaggtggc ccgcatccct gtcatccggc gtgtcctgga cggcgggaag 3060 

tcccaggtct cctaccgcac acaggatggc accgcgcagg gcaaccggga ctacatcccc 3120 

gtggagggtg agctgctgtt ccagcctggg gaggcctgga aagagctgca ggtgaagctc 3180 

ctggagctgc aagaagttga ctccctcctg cggggccgcc aggtccgccg tttccacgtc 324 0 

cagctcagca accctaagtt tggggcccac ctgggccagc cccactccac caccatcatc 3300 

atcagggacc cagatgaact ggaccggagc ttcacgagtc agatgttgtc atcacagcca 3360 

ccccctcacg gcgacctggg cgccccgcag aaccccaatg ctaaggccgc tgggtccagg 3420 

aagatccatt tcaactggct gcccccttct ggcaagccaa tggggtacag ggtaaagtac 3480 

tggattcagg gtgactccga atccgaagcc cacctgctcg acagcaaggt gccctcagtg 3540 

gagctcacca acctgtaccc gtattgcgac tatgagatga aggtgtgcgc ctacggggct 3600 

cagggcgagg gaccctacag ctccctggtg tcctgccgca cccaccagga agtgcccagc 3660 

gagccagggc gtctggcctt caatgtcgtc tcctccacgg tgacccagct gagctgggct 3720 

gagccggctg agaccaacgg tgagatcaca gcctacgagg tctgctatgg cctggfccaac 3780 

gatgacaacc gacctattgg gcccatgaag aaagtgctgg ttgacaaccc taagaaccgg 3840 

atgctgctta ttgagaacct tcgggagtcc cagccctacc gctacacggt gaaggcgcgc 3900 

aacggggccg gctgggggcc tgagcgggag gccatcatca acctggccac ccagcccaag 3960 

aggcccatgt ccatccccat catccctgac atccctatcg tggacgccca gagcggggag 4020 

gactacgaca gcttccttat gtacagcgat gacgttctac gcictccatc gggcagccag 4080 

aggcccagcg tctccgatga cactggctgc ggctggaagt tcgagcccct gctgggggag 4140 

gagctggacc tgcggcgcgt cacgtggcgg ctgcccccgg agctcatccc gcgcctgtcg 4200 

gccagcagcg ggcgctcctc cgacgccgag gcgccccacg ggcccccgga cgacggcggc 4260 

gcgggcggga agggcggcag cctgccccgc agtgcgacac ccgggccccc cggagagcac 4320 

ctggfcgaatg gccggatgga ctttgccttc ccgggcagca ccaactccct gcacaggatg 4380 

accacgacca gtgctgctgc ctatggcacc cacctgagcc cacacgtgcc ccaccgcgtg 4440 

ctaagcacat cctccaccct cacacgggac tacaactcac tgacccgctc agaacactca 4500 

cactcgacca cactgccgag ggactactcc accctcacct ccgtctcctc ccacggcctc 4560 

cctcccatct gggaacacgg gaggagcagg cttccgctgt cctgggccct ggggtcccgg 4 620 

agtcgggctc agatgaaagg gttcccccct tccaggggcc cacgagactc tataatcctg 4 680 

gctgggaggc cagcagcgcc ctcctggggc ccagactctc gcctgactgc tggtgtgccc 4740 

gacacgccca cccgcctggt gttctctgcc ctggggccca catctctcag agtgagctag 4800 

caggagccgc ggtgcgagcg gccgctgcag ggctacagtg tggagtacca gctgctgaac 4860 

ggcggtgagc tgcatcggct caacatcccc aaccctgccc agacctcggt ggtggtggaa 4920 

gacctcctgc ccaaccactc ctacgtgttc cgcgtgcggg cccagagcca ggaaggctgg 4980 

ggccgagagc gfcgagggtgt catcaccatt gaatcccagg tgcacccgca gagcccactg 5040 

tgtcccctgc caggctccgc cttcactttg agcactccca gtgccccagg cccgctggtg 5100 

ttcactgccc tgagcccaga ctcgctgcag ctgagctggg agcggccacg gaggcccaat 5160 

ggggatatcg tcggctacct ggtgacctgt gagatggccc aaggaggagg tccagccacc 5220 

gcattccggg tggatggaga cagccccgag agccggctga ccgtgccggg ccfccagcgag 5280 

aacgtgccct acaagttcaa ggtgcaggcc aggaccactg agggcttcgg gccagagcgc 5340 

gagggcatca tcaccataga gtcccaggafc ggaggtccct tcccgcagct gggcagccgt 5400 

gccgggctct tccagcaccc gctgcaaagc gagfcacagca gcatctccac cacccacacc 5460 

agcgccaccg agcccttcct agtgggtccg accctggggg cccagcacct ggaggcaggc 5520 

ggctccctca cccggcatgt gacccaggag tttgtgagcc ggacactgac caccagcgga 5530 

acccttagca cccacatgga ccaacagttc ttccaaacfct ga ' ~" 5622 
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<400> 17 



Met 


Ala 


Gly 


Pro 


Arg 


Pro 


Ser 


Pro 


1 








5 








Leu 


lie 


Ser 


Val 


Ser 


Leu 


Ser 


Gly 








20 










Pro 


lie 


Lys 


Ser 


Cys 


Thr 


Glu 


Cys 






35 










4 0 


Tyr 


Cys 


Thr 


Asp 


Glu 


Met 


Phe 


Arg 




50 










55 




Glu 


Leu 


Leu 


Ala 


Ala 


Gly 


Cys 


Gin 


65 










70 




Ser 


Ser 


Phe 


Gin 


lie 


Thr 


Glu 


Glu 










85 








Arg 


Ser 


Gin 


Met 


Ser 


Pro 


Gin 


Gly 








100 










Glu 


Glu 


Arg 


His 


Phe 


Glu 


Leu 


Glu 






i*. 1^ *5 










120 


Val 


Asp 


Leu 


Tyr 


lie 


Leu 


Met 


Asp 




130 










135 




Leu 


Asp 


Asn 


Leu 


Lys 


Lys 


Met 


Gly 


145 










150 






Gin 


Leu 


Thr 


Ser 


Asp 


Tyr 


Thr 


lie 










165 








val 


Ser 


Val 


Pro 


Gin 


Thr 


Asp 


Met 








180 










Trp 


Pro 


Asn 


Ser 


Asp 


Pro 


Pro 


Phe 






195 










200 


Thr 


Glu 


Asp 


Val 


Asp 


Glu 


Phe 


Arg 




210 










215 




Ser 


Gly 


Asn 


Leu 


Asp 


Ala 


Pro 


Glu 


225 










230 






Thr 


Ala 


Val 


Cys 


Thr 


Arg 


Asp 


lie 










245 








Leu 


Leu 


Val 


Phe 


Ser 


Thr 




Ser 








260 










Ala 


Asn 


Val 


Leu 


Ala 


Gly 


lie 


Met 






275 










280 


Leu 


Asp 


Thr 


Thr 


Gly 


Thr 


Tyr 


Thr 




290 










295 




Ser 


Val 


Pro 


Thr 


Leu 


Val 


Arg 


Leu 


305 










310 






lie 


Phe 


Al a 


Val 


Thr 


Asn 


Tyr 


Ser 










325 








Thr 


Tyr 


Phe 


Pro 


Val 


Ser 


Ser 


Leu 








340 










Asn 


lie 


Val 


Glu 


Leu 


Leu 


Glu 


Glu 






355 










3 60 


Leu 


Asp 


lie 


Arg 


Ala 


Leu 


Asp 


Ser 




370 










375 




Thr 


Ser 


Lys 


Met 


Phe 


Gin 


Lys 


Thr 


385 










390 






Arg 


Gly 


Glu 


val 


Gly 


lie 


Tyr 


Gin 










405 








Val 


Asp 




Thr 


His 


Val 


Cys 


Gin 



4 20 
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Sequence; Note ~ 



Trp 


Ala 


Arg 


Leu 


Leu 


Leu 


Ala 


Ala 




10 










15 




Thr 


Leu 


Asn 


Arg 


Cys 


Lys 


Lys 


Ala 


2 5 










30 






Val 


Arg 


Val 


Asp 


Lys 


Asp 


Cys 


Ala 










45 








Asp 


Arg 


Arg 


Cys 


Asn 


Thr 


Gin 


Ala 








60 










Arg 


Glu 


Ser 


lie 


Val 


Val 


Met 


Glu 




75 










80 


Thr 


Gin 


lie 


Asp 


Thr 


Thr 


Leu 


Arg 




90 










95 




Leu 


Arg 


Val 


Arg 


Leu 


Arg 


Pro 


Gly 


105 










110 






Val 


Phe 


Glu 


Pro 


Leu 


Glu 


Ser 


Pro 










125 








Phe 


Ser 


Asn 


Ser 


Met 


Ser 


Asp 


Asp 








14 0 










Gin 


Asn 


Leu 


Ala 


Arg 


Val 


Leu 


Ser 






1 5 5 








160 


Gly 


Phe 


Gly 


Lys 


Phe 


Val 


Asp 


Lys 




17 0 










175 




Arg 


Pro 


Glu 


Lys 


Leu 


Lys 


Glu 


Pro 


185 










190 






Ser 


Phe 


Lys 


Asn 


Val 


He 


Ser 


Leu 








205 








Asn 


Lys 


Leu 


Gin 


Gly 


Glu 


Arg 


He 








22 0 










Gly 


Gly 


Phe 


Asp 


Ala 


He 


Leu 


Gin 






235 










240 


Gly 


Trp 


Arg 


Pro 


Asp 


Ser 


Thr 


His 




250 










255 




Ala 


Phe 


His 


Tyr 


Glu 


Ala 


Asp 


Gly 


2 65 










270 






Ser 


Arg 


Asn 


Asp 


Glu 


Arg 


Cys 


His 










285 








Gin 


Tyr 


Arg 


Thr 


Gin 


Asp 


Tyr 


Pro 








300 










Leu 


Ala 


Lys 


Hi s 


Asn 


He 




Pro 






315 










320 


Tyr 


Ser 


Tyr 


Tyr 


Glu 


Lys 


Leu 


His 














335 




Gly 


Val 


Leu 


Gin 


Glu 


Asp 


Ser 


Ser 


345 










350 






Ala 


Phe 


Asn 


Arg 


lie 


Arg 


S e r 


Asn 










365 








Pro 


Arg 


Gly 


Leu 


Arg 


Thr 


Glu 


Val 








380 










Arg 


Thr 


Gly 


Ser 


Phe 


His 


tie 


Arg 






395 










4 00 


Val 


Gin 




Arg 


Ala 


Leu 


Glu 


His 




410 










415 




Leu 


Pro 


Glu 


Asp 


Gin 


Lys 


Gly 


Asn 


425 










430 
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lie 


HIS 


Leu 


Lys 


Pro 


Ser 


Pne 


Ser 


Asp 






4 35 










44 0 




■y "I 

*-L< 




Asp 


Val 


Cys 


Thr 


Cys 


Glu 




4 50 










4 55 








Arg 


Cys 


t~i „ 

Ser 


Phe 


Asn 


f — ■ -i, 
Gly 


Asp 


Phe 


4 65 










4 / 0 








C „ A 

osr 


Glu 


Gly 


Trp 


Ser 


Gly 


bin 


i it i 

Thr 


Cys 










485 










Ser 


Asp 


lie 


Gin 


Pro 


»— < 

Cys 


Leu 


Arg 


G JL ul 








50 0 








/"N 

505 


<— > i 
Gly 


Arg 


Gly 


Glu 


Cys 


Gin 


Cys 


Gly 


■Tl 1 S 






515 










52 0 




Arg 


Tyr 


Glu 


Gly 


Gin 


Phe 


Cys 


Glu 


Tyr 




530 










5^5 






HP K ^» 

i nr 


Ser 


Gly 


Til* * 

Phe 


Leu 


Cys 


Asn 


Asp 


Arg 


5 4 5 










550 








Cys 


Val 


Cys 


Glu 


Pro 


Gly 


Trp 


Thr 


Gly 










5 65 










C? y-*\ *v* 

o © jr 


Asn 


T, 1. 

Ala 


Thr 


Cys 


lie 


Asp 


Ser 


Asn 








r*** Oi r\ 

58 0 










585 


Gly 


His 


* H 

Cys 


Glu 


Cys 


Gly 


Arg 


Cys 


His 






595 










600 




Thr 


Asp 


Thr 


lie 


Cys 


i— * i. 
Glu 


lie 


Asn 


Tyr 




610' 










615 






Ala 


Arg 


Thr 


Tyr 


Al a 


Pro 


Ala 


Cys 


Ser 


i~ r-\ r~ 

625 










630 








Arg 


Arg 


Arg 


Gly 


Ala 


Arg 


Val 


Arg 


Asn 










645 










*"P "1^- -»^-~v 

Trp 


Tor 


Ser 


Leu 


Arg 




Ala 


Arg 


Arg 








660 










665 


Gly 


Thr 


Arg 


Met 


Thr 


Thr 


A*. JL 3. 


Pro 


Thr 






675 










68 0 




Ala 


Pro 


Leu 


Gly 


Pro 


Thr 


Ala 


Leu 


3« 




f\ f\ 

690 










695 






Cys 


Pro 


Pro 


Gly 


Ser 


Phe 


Trp 


Trp 


Leu 


7 0'5 










710 








Leu 


Pro 


Leu 


Leu 


Ala 


Leu 


Leu 


Leu 


Leu 










725 










Cys 


Cys 


Lys 


Ala 


Cys 


Leu 


Ala 


Leu 


Leu 








74 0 










745 


Met 


Val 


Gly 


Phe 


Lys 


Glu 


Asp 


His 


Tyr 






1 55 










760 




"7*, n 

Ala 


Ser 


Asp 


His 


Leu 


Asp 


Thr 


Pro 


Met 




77 q 








775 






Gly 


Arg 


Asp 


Val 


Val 


Arg 


Trp 


Lys 


Val 


785 










~f J""> 

790 








Gly 


Phe 


Ala 


Thr 


Hxs 


Ala 


Ala 


Ser 


lie 










805 










Tyr 


Gly 


Leu 


Ser 


Leu 


Arg 


Leu 


Ala 


Arg 








820 










825 


Lys 


Pro 


Asp 


Thr 


Arg 


Glu 


Cys 


Ala 


Gin 






835 










84 0 




Asn 


Leu 


Asn 


Glu 


Val 


Tyr 


Arg 


Gin 


lie 




850 










855 






Gin 


Thr 


Lys 


Phe 


Arg 


Gin 


Gin 


Pro 


Asn 


865 










87 0 








Thr 


lie 


Val 


Asp 


Thr 


Val 


Leu 


Met 


Ala 










885 










Leu 


Leu 


Lys 


Leu 


Thr 


Glu 


Lys 


Gin 


Val 








900 










905 


Leu 


Lys 


Val 


Ala 


Pro 


Gly 


Tyr 


Tyr 


Thr 
















920 
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Gly 


Leu 


Lys 


Met 


Asp 


Ala 


Gly 








445 








Leu 


Gin 


Lys 


Glu 


Val 


Arg 


Ser 






4 60 










Val 


Cys 


Gly 


Gin 


Cys 


Val 


Cys 




475 










480 


Asn 


Cys 


Ser 


Thr 


Gly 


Ser 


Leu 


490 










495 




Gly 


Glu 


Asp 


Lys 


Pro 


Cys 


Ser 










510 






Cys 


Val 


Cys 


Tyr 


Gly 


Glu 


Gly 








525 








Asp 


Asn 


Phe 


Gin 


Cys 


Pro 


Arg 






540 










Gly 


Arg 


Cys 


Ser 


Met 


Gly 


Gin 




555 










560 


Pro 


Ser 


Cys 


Asp 


Cys 


Pro 


Leu 


57 0 










575 




Gly 


Gly 


lie 


Cys 


Asn 


Gly 


Arg 










590 






Cys 


His 


Gin 


Gin 


Ser 


Leu 


Tyr 








605 








Ser 


Ala 


Ser 


Thr 


Arg 


Ala 


Ser 






620 










Ala 


Arg 


Arg 


Gly 


Ala 


Pro 


Ala 




635 










64 0 


Ala 


Thr 


Ser 


A.rg 


Ser 


Arg 


Trp 


650 










655 




Trp 


Trp 


Cys 


A 1 a 


.Al a 


Pro 


Ser 










67 0 






Ala 


Thr 


Pro 


Trp 


Lys 


Val 


Thr 








685 






Trp 


Cys 


Thr 


Arg 


Arg 


Arg 


Asp 






700 










lie 


Pro 


Leu 


Leu 


Leu 


Leu 


Leu 




715 










720 


Leu 


Cys 


Trp 


Lys 


Tyr 


Cys 


Ala 


7 30 










735 




Pro 


Cys 


Cys 


Asn 


Arg 


Gly 


His 










7 50 






Met 


Leu 


Arg 


Glu 


Asn 


Leu 


Met 








765 








Leu 


Arg 


Ser 


Gly 


Asn 


Leu 


Lys 






780 










Thr 


Asn 


Asn 


Met 


Gin 


Arg 


Pro 




795 










800 


Asn 


Pro 


Thr 


Glu 


Leu 


Val 


Pro 


810' 










815 




ijeu. 


uys 


HP V> t- 

xnr 


Pin 
wl U 


7\ o n 


jueu 


.L>e u 










830 






Leu 


Arg 


Gin 


Glu 


Val 


Glu 


Glu 








845 








Ser 


Gly 


Val 


His 


Lys 


Leu 


Gin 






860 










Ala 


Gly 


Lys 


Lys 


Gin 


Asp 


His 




875 










880 


Pro 


Arg 


Ser 


Ala 


Lys 


Pro 


Ala 


890 










895 




Glu 


Gin 


Arg 


Ala 


Phe 


His 


Asp 










910 




Leu 


Thr 


Ala 


Asp 

n ^ r 


Gin 


Asp 


Ala 



925 



WO 02/30465 



PCT/US01/32127 



35 



Arg Gly Met Val GJLu Phe Gin Glu Gly Val Glu Leu Val Asp Val Arg 

930 935 940 

Val Pro Leu Phe lie Arg Pro Glu Asp Asp Asp Glu Lys Gin Leu Leu 
945 950 955 960 

Val Glu Ala He Asp Val Pro Ala Gly Thr Ala Thr Leu Gly Arg Arg 

965 970 975 

Leu Val Asn lie Thr He He Lys Glu Gin Ala Arg Asp Val Val Ser 

980 985 990 

Phe Glu Gin Pro Glu Phe Ser Val Ser Arg Gly Asp Gin Val Ala Arg 

995 1000 1005 

He Pro Val He Arg Arg Val Leu Asp Gly Gly Lys Ser Gin Val Ser 

1010 1015 1020 

Tyr Arg Thr Gin Asp Gly Thr Ala Gin Gly Asn Arg Asp Tyr He Pro 
1025 1030 1035 1040 

Val Glu Gly Glu Leu Leu Phe Gin Pro Gly Glu Ala Trp Lys Glu Leu 

1045 1050 1055 

Gin Val Lys Leu Leu Glu Leu Gin Glu Val Asp Ser Leu Leu Arg Gly 

1060 1065 1070 

Arg Gin Val Arg Arg Phe His Val Gin Leu Ser Asn Pro Lys Phe Gly 

1075 1080 1085 

Ala His Leu Gly Gin Pro His Ser Thr Thr He He lie Arg Asp Pro 

1090 1095 1100 

Asp Glu Leu Asp Arg Ser Phe Thr Ser Gin Met Leu Ser Ser Gin Pro 
1105 1110 1115 1120 

Pro Pro His Gly Asp Leu Gly Ala Pro Gin Asn Pro Asn Ala Lys Ala 

1125 1130 1135 

Ala Gly Ser Arg Lys He His Phe Asn Trp Leu Pro Pro Ser Gly Lys 

1140 1145 1150 

Pro Met Gly Tyr Arg Val Lys Tyr Trp He Gin Gly Asp Ser Glu Ser 

1155 1160 1165 

Glu Ala His Leu Leu Asp Ser Lys Val Pro Ser Val Glu Leu Thr Asn 

1170 1175 1180 

Leu Tyr Pro Tyr Cys Asp Tyr Glu Met Lys Val Cys Ala Tyr Gly Ala 
1185 1190 1195 1200 

Gin Gly Glu Gly Pro Tyr Ser Ser Leu Val Ser Cys Arg Thr His Gin 

1205 1210 " 1215 

Glu Val Pro Ser Glu Pro Gly Arg Leu Ala Phe Asn Val Val Ser Ser 

1220 1225 1230 

Thr Val Thr Gin Leu Ser Trp Ala Glu Pro Ala Glu Thr Asn Gly Glu 

1235 1240 1245 

He Thr Ala Tyr Glu Val Cys Tyr Gly Leu Val Asn Asp Asp Asn Arg 

1250 1255 1260 

Pro He Gly Pro Met Lys Lys Val Leu Val Asp Asn Pro Lys Asn Arg 
1265 1270 1275 1280 

Met Leu Leu He Glu Asn Leu Arg Glu Ser Gin Pro Tyr Arg Tyr Thr 

1285 1290 1295 

Val Lys Ala Arg Asn Gly Ala Gly Trp Gly Pro Glu Arg Glu Ala He 

1300 1305 1310 

He Asn Leu Ala Thr Gin Pro Lys Arg Pro Met Ser He Pro He He 

1315 1320 1325 

Pro Asp lie Pro lie Val Asp Ala Gin Ser Gly Glu Asp Tyr Asp Ser 

1330 1335 134D 

Phe Leu Met Tyr Ser Asp Asp Val Leu Arg Ser Pro Ser Gly Ser Gin 
1345 1350 1355 1360 

Arg Pro Ser Val Ser Asp Asp Thr Gly Cys Gly Trp Lys Phe Glu Pro 

1365 1370 1375 

Leu Leu Gly Glu Glu Leu Asp Leu Arg Arg Val Thr Trp Arg Leu Pro 

1380 1385 1390 

Pro Glu Leu He Pro Arg Leu Ser Ala Ser Ser Gly Arg Ser Ser Asp 

1395 1400 1405 

Ala Glu Ala Pro His Gly Pro Pro Asp Asp Gly Gly Ala Gly Gly Lys 
1410 1415 1420 
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Gly Gly Ser Leu Pro Arg Ser Ala Thr Pro Gly Pro Pro Gly Glu His 
1425 1430 1435 1440 

Leu Val Asn Gly Arg Met Asp Phe Ala ?he Pro Gly Ser Thr Asn Ser 

1445 1450' 1455 

Leu His Arg Met Thr Thr Thr Ser Ala Ala Ala Tyr Gly Thr His Leu 

14 60 14 65 14 7 0 

Ser Pro His Val Pro His Arg Val Leu Ser Thr Ser Ser Thr Leu Thr 

1475 1480 1485 

Arg Asp Tyr Asn Ser Leu Thr Arg Ser Glu His Ser His Ser Thr Thr 

14 90 14 95 1500' 

Leu Pro Arg Asp Tyr Ser Thr Leu Thr Ser Val Ser Ser His Glv Leu 
1505 1510 1515 " 1520 

Pro Pro lie Trp Glu His Gly Arg Ser Arg Leu Pro Leu Ser Trp Ala 

1525 1530 1535 

Leu Gly Ser Arg Ser Arg Ala Gin Met Lys Gly Phe Pro Pro Ser Arg 

1540 1545 1550 

Gly Pro Arg Asp Ser lie lie Leu Ala Gly Arg Pro Ala Ala Pro Ser 

1555 1560 1565 

Trp Gly Pro Asp Ser Arg Leu Thr Ala Gly Val Pro Asp Thr Pro Thr 

1570 1575 1580 

Arg Leu Val Phe Ser Ala Leu Gly Pro Thr Ser Leu Arg Val Ser Trp 
1585 1590 1595 1600 

Gin Glu Pro Arg Cys Glu Arg Pro Leu Gin Gly Tyr Ser Val Glu Tyr 

1605 1610 " 1615 

Gin Leu Leu Asn Gly Gly Glu Leu His Arg Leu Asn lie Pro Asn Pro 

1620 1625 1630 

Ala Gin Thr Ser Val Val Val Glu Asp Leu Leu Pro Asn His Ser Tyr 

1635 1640 1645 

Val Phe Arg Val Arg Ala Gin Ser Gin Glu Gly Trp Glv Arg Glu Arg 

1650 1655 1660 

Glu Gly Val lie Thr He Glu Ser Gin Val His Pro Gin Ser Pro Leu 
1665 1670 1675 1680 

Cys Pro Leu Pro Gly Ser Ala Phe Thr Leu Ser Thr Pro Ser Ala Pro 

1685 1690 1695 

Gly Pro Leu Val Phe Thr Ala Leu Ser Pro Asp Ser Leu Gin Leu Ser 

1700 1705 1710 

Trp Glu Arg Pro Arg Arg Pro Asn Gly Asp lie Val Gly Tyr Leu Val 

1715 1720 1725 

Thr Cys Glu Met Ala Gin Gly Gly Gly Pro Ala Thr Ala Phe Arg Val 

1730 1735 1740 

Asp Gly Asp Ser Pro Glu Ser Arg Leu Thr Val Pro Gly Leu Ser Glu 
1745 1750 1755 1760 

Asn Val Pro Tyr Lys Phe Lys Val Gin Ala Arg Thr Thr Glu Gly Phe 

1765 1770 1775 

Gly Pro Glu Arg Glu Gly He He Thr He Glu Ser Gin Asp Gly Gly 

1780 1785 1790 

Pro Phe Pro Gin Leu Gly Ser Arg Ala Gly Leu Phe Gin His Pro Leu 

1795 1800 " 1805 

Gin Ser Glu Tyr Ser Ser lie Ser Thr Thr His Thr Ser Ala Thr Glu 

1810 1815 1820 

Pro Phe Leu Val Gly Pro Thr Leu Gly Ala Gin His Leu Glu Ala Gly 
1825 1830 1835 1840 

Gly Ser Leu Thr Arg His Val Thr Gin Glu Phe Val Ser Arg Thr Leu 

1845 1850 ' 1855 

Thr Thr Ser Gly Thr Leu Ser Thr His Met Asp Gin Gin Phe Phe Gin 

1860 1365 1870 

Thr 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Note ~ 
synthetic construct 



<400> 18 

atggcagggc cacgccccag cccatgggcc aggctgctcc tggcagcctt gatcagcgtc 60 

agcctctctg ggaccttgaa ccgctgcaag aaggccccaa taaagagctg cacggagtgt 120' 

gtccgtgtgg ataaggactg cgcctactgc acagacgaga tgttcaggga ccggcgctgc 180' 

aacacccagg cggagctgct ggccgcgggc tgccagcggg agagcatcgt ggtcatggag 24 0 

agcagcttcc aaatcacaga ggagacccag attgacacca ccctgcggcg cagccagatg 300 

tccccccaag gcctgcgggt ccgtctgcgg cccggtgagg agcggcattt tgagctggag 360 

gtgtttgagc cactggagag ccccgtggac ctgtacatcc tcatggactt ctccaactcc 420 

atgtccgatg atctggacaa cctcaagaag atggggcaga acctggctcg ggtcctgagc 4 80 

cagctcacca gcgactacac tattggattt ggcaagtttg tggacaaagt cagcgtcccg 540 

cagacggaca tgaggcctga gaagctgaag gagccttggc ccaacagtga cccccccttc 600 

tccttcaaga acgtcatcag cctgacagaa gatgtggatg agttccggaa taaactgcag 660 

ggagagcgga tctcaggcaa cctggatgct cctgagggcg gcttcgatgc catcctgcag 720 

acagctgtgt gcacgaggga cattggctgg cgcccggaca gcacccacct gctggtcttc 780 

tccaccgagt cagccttcca ctatgaggct gatggcgcca acgtgctggc tggcatcatg 840 

agccgcaacg atgaacggtg ccacctggac accacgggca cctacaccca gtacaggaca 900 

caggactacc cgtcggtgcc caccctggtg cgcctgctcg ccaagcacaa catcatcccc 960 

atctttgctg tcaccaacta ctcctatagc tactacgaga agcttcacac ctatttccct 1020 

gtctcctcac tgggggtgct gcaggaggac tcgtccaaca tcgtggagct gctggaggag 1080 

gccttcaatc ggatccgctc caacctggac atccgggccc tagacagccc ccgaggcctt 114 0 

cggacagagg tcacctccaa gatgttccag aagacgagga ctgggtcctt tcacatccgg 1200 

cggggggaag tgggtatata ccaggtgcag ctgcgggccc ttgagcacgt ggatgggacg 12 60 

cacgtgtgcc agctgccgga ggaccagaag ggcaacatcc atctgaaacc ttccttctcc 1320 

gacggcctca agatggacgc gggcatcatc tgtgatgtgt gcacctgcga gctgcaaaaa 1380 

gaggtgcggt cagctcgctg cagcttcaac ggagacttcg tgtgcggaca gtgtgtgtgc 1440 

agcgagggct ggagtggcca gacctgcaac tgctccaccg gctctctgag tgacattcag 1500 

ccctgcctgc gggagggcga ggacaagccg tgctccggcc gtggggagtg ccagtgcggg 1560 

cactgtgtgt gctacggcga aggccgctac gagggtcagt tctgcgagta tgacaacttc 1620 

cagtgtcccc gcacttccgg gttcctctgc aatgaccgag gacgctgctc catgggccag 1680 

tgtgtgtgtg agcctggttg gacaggccca agctgtgact gtcccctcag caatgccacc 1740 

tgcatcgaca gcaatggggg catctgtaat ggacgtggcc actgtgagtg tggccgctgc 1800 

cactgccacc agcagtcgct ctacacggac accatctgcg agatcaacta ctcggcgtcc 1860 

acccgggcct ctgcgaggac ctacgctcct gcgtgcagtg ccaggcgtgg ggcaccggcg 1920 

agaagaaggg gcgcacgtgt gaggaatgca acttcaaggt caagatggtg gacgagctta 1980 

agagaggcga ggaggtggtg gtgcgctgct ccttccggga cgaggatgac gactgcacct 2040 

acagctacac catggaaggt gacggcgccc ctgggcccaa cagcactgtc ctggtgcaca 2100 

agaagaaggg actgccctcc gggctccttc tggtggctca tccccctgct cctcctcctc 2160 

ctgccgctcc tggccctgct actgctgcta tgctggaagt actgtgcctg ctgcaaggcc 2220 

tgcctggcac ttctcccgtg ctgcaaccga ggtcacatgg tgggctttaa ggaagaccac 2280 

tacatgctgc gggagaacct gatggcctct gaccacttgg acacgcccat gctgcgcagc 2340 

gggaacctca agggccgtga cgtggtccgc tggaaggtca ccaacaacat gcagcggcct 2400 

ggctttgcca ctcatgccgc cagcatcaac cccacagagc tggtgcccta cgggctgtcc 24 60 

ttgcgcctgg cccgcctttg caccgagaac ctgctgaagc ctgacactcg ggagtgcgcc 2520 

cagctgcgcc aggaggtgga ggagaacctg aacgaggtct acaggcagat ctccggtgta 2580 

cacaagctcc agcagaccaa gttccggcag cagcccaatg ccgggaaaaa gcaagaccac 2640 

accattgtgg acacagtgct gatggcgccc cgctcggcca agccggccct gctgaagctt 2700 

acagagaagc aggtggaaca gagggccttc cacgacctca aggtggcccc cggctactac 27 60 

accctcactg cagaccagga cgcccggggc atggtggagt tccaggaggg cgtggagctg 2820 

gtggacgtac gggtgcccct ctttatccgg cctgaggatg acgacgagaa gcagctgctg 2880 

gtggaggcca tcgacgtgcc cgcaggcact gccaccctcg gccgccgcct ggtaaacatc 2940 

accatcatca aggagcaagc cagagacgtg gtgtcctttg agcagcctga gttctcggtc 3000 

agccgcgggg accaggtggc ccgcatccct gtcatccggc gtgtcctgga cggcgggaag 3060 

tcccaggtct cctaccgcac acaggatggc accgcgcagg gcaaccggga ctacatcccc 3120 

gtggagggtg agctgctgtt ccagcctggg gaggcctgga aagagctgca ggtgaagcbc 3180 

ctggagctgc aagaagttga ctccctcctg cggggccgcc aggtccgccg tttccacgtc 3240 

cagctcagca accctaagtfc tggggcccac ctgggccagc cccactccac caccatcatc 3300 
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atcagggacc cagatgaact ggaccggagc ttcacgagtc agabgttgtc atcacagcca 3360 

ccccctcacg gcgaccbggg cgccccgcag aaccccaatg ctaaggccgc tgggtccagg 3420' 

aagatccatt tcaactggct gcccccttct ggcaagccaa tggggtacag ggtaaagtac 3480' 

fcggattcagg gtgactccga afcccgaagcc cacctgctcg acagcaaggt gccctcagbg 3540 

gagcbcacca acctgtaccc gbabbgcgac tatgagatga aggtgtgcgc ctacggggct 3600 

cagggcgagg gaccctacag cbccctggbg tccbgccgca cccaccagga agtgcccagc 3660 

gagccagggc gtctggcctt caatgtcgtc tcctccacgg tgacccagct gagctgggct 3720 

gagccggctg agaccaacgg tgagatcaca gcctacgagg tctgctatgg cctggtcaac 3780 

gatgacaacc gacctattgg gcccatgaag aaagtgctgg btgacaaccc taagaaccgg 3840 

atgctgctta ttgagaacct bcgggagtcc cagccctacc gctacacggt gaaggcgcgc 3900 

aacggggccg gctgggggcc tgagcgggag gccatcatca acctggccac ccagcccaag 3960 

aggcccatgt ccafcccccat catccctgac atccctatcg tggacgccca gagcggggag 4020 

gactacgaca gcttccttat gtacagcgat gacgttctac gctctccatc gggcagccag 4080 

aggcccagcg tctccgatga cactggctgc ggctggaagt tcgagcccct gctgggggag 414 0 

gagctggacc tgcggcgcgt cacgtggcgg ctgcccccgg agctcatccc gcgcctgtcg 4200 

gccagcagcg ggcgctcctc cgacgccgag gcgccccacg ggcccccgga cgacggcggc 4260 

gcgggcggga agggcggcag cctgccccgc agtgcgacac ccgggccccc cggagagcac 4320 

ctggtgaatg gccggatgga ctttgccttc ccgggcagca ccaactccct gcacaggatg 4380 

accacgacca gtgctgctgc ctatggcacc cacctgagcc cacacgtgcc ccaccgcgtg 4440 

ctaagcacat cctccaccct cacacgggac tacaactcac tgacccgctc agaacactca 4500 

cactcgacca cacfcgccgag ggactactcc accctcacct ccgtctcctc ccacggcctc 4560 

cctcccatct gggaacacgg gaggagcagg cttccgctgt cctgggccct ggggtcccgg 4620 

agtcgggctc agatgaaagg gttcccccct tccaggggcc cacgagactc tataatcctg 4680 

gctgggaggc cagcagcgcc ctcctggggc ccagactctc gcctgactgc tggtgtgccc 4740 

gacacgccca cccgcctggt gttctctgcc ctggggccca catctctcag agtgagctgg 4800 

caggagccgc ggtgcgagcg gccgctgcag ggctacagtg tggagtacca gctgctgaac 48 60 

ggcggtgagc tgcatcggct caacatcccc aaccctgccc agacctcggt ggtggtggaa 4920 

gacctcctgc ccaaccactc ctacgtgttc cgcgtgcggg cccagagcca ggaaggctgg 4980 

ggccgagagc gtgagggtgt catcaccatt gaatcccagg tgcacccgca gagcccactg 504 0 

tgtcccctgc caggctccgc cttcactttg agcactccca gtgccccagg cccgctggtg 5100 

ttcactgccc tgagcccaga ctcgctgcag ctgagctggg agcggccacg gaggcccaat 5160 

ggggatatcg tcggctacct ggtgacctgt gagatggccc aaggaggagg tccagccacc 5220 

gcattccggg tggatggaga cagccccgag agccggctga ccgtgccggg cctcagcgag 5280 

aacgtgccct acaagttcaa ggtgcaggcc aggaccactg agggcttcgg gccagagcgc 534 0 

gagggcatca tcaccataga gtcccaggat ggaggtccct tcccgcagct gggcagccgt 5400 

gccgggctct tccagcaccc gctgcaaagc gagtacagca gcatctccac cacccacacc 5460 

agcgccaccg agcccttcct agtgggtccg accctggggg cccagcacct ggaggcaggc 5520 

ggctccctca cccggcatgt gacccaggag tttgtgagcc ggacactgac caccagcgga 5580 

acccttagca cccacatgga ccaacagttc tbccaaactt ga 5622 
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<400> 19 

atggcagggc cacggcccag cccatgggcc aggctgctcc tggcagcctt gatcagcgtc 60 

agcctctctg ggaccttgaa ccgctgcaag aaggccccaa taaagagctg cacggagtgt 120 

gtccgtgtgg ataaggactg cgcctactgc acagacgaga tgttcaggga ccggcgcbgc 180 

aacacccagg cggagcbgct ggccgcgggc tgccagcggg agagcatcgt ggtcatggag 240 

agcagctbcc aaatcacaga ggagacccag afctgacacca cccbgcggcg cagccagatg 300 

tccccccaag gcctgcgggt ccgbcbgcgg cccggtgagg agcggcattt tgagctggag 360 

gtgbttgagc cactggagag ccccgtggac ctgtacatcc tcabggactt ctccaactcc 420 

atgtccgatg atctggacaa cctcaagaag atggggcaga accbggctcg ggtcctgagc 480 

cagctcacca gcgactacac tabtggattt ggcaagbtbg tggacaaagt cagcgtcccg 540 

cagacggaca tgaggccfcga gaagctgaag gagccttggc ccaacagbga cccccccbtc 600 
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tcct tcaaga 

ggagagcgga 

acagctgtgt 

tccaccgagt 

agccgcaacg 

caggactacc 

atct ttgctg 

gtctccfccac 

gccttcaatc 

cggacagagg 

cggggggaag 

cacgtgtgcc 

gacggcctca 

gaggtgcggt 

agcgagggct 

ccc tgcctgc 

cactgtgtgt 

cagt gtcccc 

tgtgtgtgtg 

tgcatcgaca 

cactgccacc 

acccgggcct 

agaagaaggg 

agagaggcga 

acagctacac 

agaagaaggg 

ctgccgctcc 

tgcctggcac 

tacatgctgc 

gggaacctca 

ggctttgcca 

ttgcgcctgg 

cagctgcgcc 

cacaagctcc 

acca ttgtgg 

acagagaagc 

accctcactg 

gtggacgtac 

gtggaggcca 

accatcatca 

agccgcgggg 

t cccaggtci 

gtggagggtg 

ctggagctgc 

cagctcagca 

atcagggacc 

ccccctcacg 

aagatccat t 

tggattcagg 

gagctcacca 

cagggcgagg 

gagccagggc 

gagccggctg 

ga tgacaacc 

a tgctgctt a 

aacggggccg 

aggcccatgt 

gactacgaca 

aggcccagcg 

gagctggacc 

gccagcagcg 

gcgggcggga 



acgtcat cag 

fccfccaggcaa 

gcacgaggga 

cagccttcca 

atgaacggtg 

cgtcggtgcc 

tcaccaacta 

tgggggtgct 

ggatccgctc 

tcacctccaa 

tgggta tata 

agctcccgga 

agatggacgc 

cagctcgctg 

ggagtggcca 

gggagggcga 

get acggcga 

gcacttccgg 

agcctggttg 

gcaatggggg 

ageagtcget 

ctgegaggae 

gcgcacgtgt 

ggaggtggtg 

catggaaggt 

actgccctcc 

tggccctgct 

ttcfccccgtg 

gggagaacct 

agggccgtga 

ctcatgccgc 

cccgcctttg 

aggaggtgga 

agcagaccaa 

acacagtget 

aggtggaaca 

cagaccagga 

gggtgcccct 

tcgacgtgcc 

aggagcaagc 

accaggtggc 

cctaccgcac 

agctgctgtt 

aagaagttga 

accctaagtt 

cagatgaact 

gcgacctggg 

fccaactggct 

gtgactccga 

acctgtaccc 

gaccctacag 

gtctggcctt 

agaccaaegg 

gaeefcattgg 

ttgagaacct 

getgggggee 

ccatccccat 

get tccttat 

tctccgatga 

tgcggcqcgt 

ggegctcctc 

aeggcugcag 



cctgacagaa 
ectggatget 
cattggctgg 
ctatgaggc t 
ccacctggac 
eacectggtg 
ctcct atagc 
geaggaggac 
caaectggac 
gatgt tccag 
ceaggtgcag 
ggaecagaag 
gggcatcat c 
cagcttcaac 
gaect gcaac 
ggacaagccg 
aggccgctac 
gttcctctge 
gaeaggecca 
catctgtaat 
ctacacggac 
ctacgctcct 
gaggaatgea 
gtgegctget 
gacggcgccc 
gggctccttc 
actgetget a 
ctgcaaccga 
gatggectet 
cgtggt cege 
cagcatcaac 
caccgagaac 
ggagaacctg 
gttccggcag 
gatggcgccc 
gagggcett c 
cgeccggggc 
ctttat cegg 
cgcaggcact 
cagagaegtg 
ccgcatecct 
acaggatggc 
ccagcctggg 
ctccctcctg 
tggggcccac 
ggaceggage 
cgccccgcag 
gcccccttct 
atccgaagcc 
gtattgegae 
ctccct ggtg 
caatgtcgt c 
tgagatcaea 
geceat gaag 
tegggagtce 
tgagegggag 
catccctgac 
gtacagegat 
eactggctge 
caegtggcgg 
cgacgccgag 
cctgccccgc 
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gatgtggatg 

cetgagggcg 

cgcecggaca 

gatggegeca 

accacgggca 

cgcctgctcg 

tactacgaga 

tcgtccaaca 

atccgggccc 

aagacgagga 

ctgcgggccc 

ggcaacatcc 

tgtgatgtgt 

ggagactt eg 

tgctccaccg 

tgctccggcc 

gagggtcagt 

aatgaccgag 

agctgtgact 

ggacgtggcc 

accatctgcg 

gcgtgcagtg 

act tcaaggt 

ccttccggga 

ctgggcccaa 

tggtggctca 

tgctggaagt 

ggtcacatgg 

gaccacttgg 

tggaaggtca 

cccacagagc 

ctgetgaage 

aacgaggtct 

cagcccaa tg 

egcteggcea 

cacgacctca 

atggtggagt 

cctgaggatg 

gccaccctcg 

gtgtcctttg 

gtcatccggc 

accgcgcagg 

gaggcctgga 

cggggccgcc 

ctgggccagc 

ttcacgagtc 

aaccccaatg 

ggcaagccaa 

cacctgctcg 

tatgagatga 

tcctgecgca 

tccfcccacgg 

gectacgagg 

aaagtgctgg 

cagccctacc 

gccatcatca 

atccctatcg 

gaegttctae 

ggctggaagt 

ctgeccccgg 

gcgccccacg 

agtgcgacac 



agttccggaa 

gettcgatge 

gcacccacct 

acgtgctggc 

cctacaccca 

ccaagcacaa 

agcttcacac 

tcgtggagct 

tagacagccc 

ctgggtcctt 

t tgagcacgt 

atctgaaacc 

gcacctgcga 

tgtgcggaca 

gctct ctgag 

gtggggagtg 

tetgegagt a 

gacgctgctc 

gtcccctcag 

actgtgagtg 

agatcaacta 

ecaggcgtgg 

caagatggtg 

cgaggatgac 

cagcactgtc 

tccccctgct 

actgtgcctg 

tgggctttaa 

acacgcccat 

ccaacaacat 

tggtgcccta 

ctgacactcg 

acaggcagat 

ccgggaaaaa 

agccggccct 

aggtggcccc 

tccaggaggg 

acgacgagaa 

gccgccgcct 

ageagectga 

gtgtectgga 

geaaceggga 

aagagctgea 

aggtccgccg 

cccact ccac 

agatgttgtc 

ctaaggcegc 

tggggt acag 

aeagcaaggt 

aggtgtgcgc 

cccaccagga 

fcgacccagct: 

tetgetatgg 

ttgacaaccc 

getacaeggt 

aectggccac 

tggacgccca 

cct ctccatc 

me 

tcgagcccct 
agctca tccc 
ggcccccgga 
ccgggccccc 



taaactgcag 

ca tectgeag 

gctggtct tc 

tggcatcafcg 

gt acaggaca 

ca tcatcccc 

eta tttccct 

gctggaggag 

ccgaggcctt 

tcacatccgg 

ggatgggacg 

fct ccttctcc 

getgeaaaaa 

gtgtgtgtgc 

tgacatt cag 

ccagtgcggg 

tgacaacttc 

ca t gggccag 

eaatgccacc 

tggccgctgc 

ct eggegt cc 

ggcaccggcg 

gacgaqctta 

gactgcacct 

ctggt gcaca 

cctcctcctc 

ctgcaaggcc 

ggaagaccac 

gctgcgcagc 

gcagcggcct 

egggctgt cc 

ggagtgcgcc 

ctccggtgt a 

gcaagaccac 

gctgaagctt 

egget actac 

cgt ggagctg 

geagctgetg 

ggtaaacatc 

gttct eggtc 

eggegggaag 

ctacatcccc 

ggtgaagctc 

tttccacgtc 

caccatcatc 

atcacagcca 

tgggtccagg 

ggtaaagtac 

gccctcagtg 

etaegggget 

agtgcccagc 

gagctgggct 

cctggtcaac 

taagaacegg 

gaaggegege 

ccagcccaag 

gageggggag 

gggcagccag 

gctgggggag 

gcgcctgtcg 

cgacggcggc 

eggagagcac 



660 
720 
780 
84 0 
900 
960 
1020 
1080 
114 0' 
1200 
1260 
1320 
1380 
14 4 0 
1500 
1560 
1620 
168 0 
17 4 0 
1800 
1860 
1920 
1980 
20 4 0 
2100 
2160 
2220 

22 8 0 

23 4 0 
2400 
2460 
2520 
2580 
2640 
27 00 
27 60 
2820 
2880 
2940 
3000 
3060 
3120 
318 0 
3240 

33 00 
3360 
3420 

34 8 0 
354 0 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4 3 2 0' 
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ctggtgaatg gccggatgga ctttgccttc ccgggcagca ccaactccct gcacaggatg 4380 

accacgacca gtgctgcfcgc ctatggcacc cacctgagcc cacacgtgcc ccaccgcgtg 4440 

ctaagcacat cctccaccct cacacgggac tacaactcac tgacccgctc agaacactca 4500 

cactcgacca cactgccgag ggactactcc accctcacct ccgtctcctc ccacggccbc 4560 

cctcccatct gggaacacgg gaggagcagg cttccgctgt cctgggccct ggggtcccgg 4620 

agfecgggctc agatgaaagg gttcccccct tccaggggcc cacgagactc tataatcctg 4680 

gctgggaggc cagcagcgcc ctcctggggc ccagactctc gcctgactgc tggtgtgccc 4740 

gacacgccca cccgcctggt gttctctgcc ctggggccca catctctcag agtgagctgg 4800 

caggagccgc ggtgcgagcg gccgctgcag ggctacagtg tggagtacca gctgctgaac 4860 

ggcggtgagc tgcatcggct caacatcccc aaccctgccc agacctcggt ggtggtggaa 4 920 

gacctcctgc ccaaccactc ctacgtgttc cgcgtgcggg cccagagcca ggaaggctgg 4980 

ggccgagagc gtgagggtgt catcaccatt gaatcccagg tgcacccgca gagcccactg 5040 

tgtcccctgc caggctccgc cttcactttg agcactccca gtgccccagg cccgctggtg 5100 

ttcactgccc tgagcccaga ctcgctgcag ctgagctggg agcggccacg gaggcccaat 5160 

ggggafcatcg tcggctacct ggtgacctgt gagatggccc aaggaggagg tccagccacc 5220 

gcattccggg tggatggaga cagccccgag agccggctga ccgtgccggg cctcagcgag 5280 

aacgtgccct acaagttcaa ggtgcaggcc aggaccactg agggcttcgg gccagagcgc 534 0 

gagggcatca tcaccataga gtcccaggat ggaggtccct tcccgcagct gggcagccgt 54 00 

gccgggctct tccagcaccc gctgcaaagc gagtacagca gcatctccac cacccacacc 54 60 

agcgccaccg agcccttcct agtgggtccg accctggggg cccagcacct ggaggcaggc 5520 

ggctccctca cccggcatgt gacccaggag tttgtgagcc ggacactgac caccagcgga 5580 

acccttagca cccacatgga ccaacagttc ttccaaactt ga 5622 



